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Amendment 

MARKED UP VERSION SHOWING CHANGES MADE : 

Please amend claims 57, 73, 77, 81, 82 and 83 as follows: 

57. (Amended) . A method for directing the 
biosynthesis of specific [macrolide] polyketide analogs by 
genetic manipulation of a polyketide-producing 
microorganism, said method comprising the steps of: 

(1) isolating a [macrolide] polyketide biosynthetic 
gene -containing DNA sequence; 

(2) identifying [enzymatic activities associated] 
within said gene -containing DNA sequence , a polyketide 
synthase domain encoding for an enzymatic activity ; 

(3) introducing one or more specified changes into 
said [gene-cotnaining] polyketide synthase domain [DNA 
sequence which codes for one of said enzymatic activities] 
resulting in an altered DNA sequence; 

(4) introducing said altered DNA sequence into a 
polyketide-producing microorganism to replace an [the] 
original sequence ; 

(5) growing a culture of the altered microorganism 
under conditions suitable for the formation of the specific 
[macrolide] polyketide analog; and 

(6) isolating said specific [macrolide] polyketide 
analog from the culture [, wherein said polyketide 
biosynthetic gene-containing DNA sequence comprises genes 
which encode the enzymatic activities comprising a 
polyketide synthase] . 

73 . (Amended) The method of claim 72 wherein said DNA 
sequence is isolated from a genus selected from the group 
[consisitng] consisting of Actinomyces, Dactylosporangium, 
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Mi cromonospora , Nocardi a , Saccharopolyspora , 
Streptoverticillium, and Streptomyces . 

77. (Amended) The method of claim 57 wherein said 
polyketide is selected from the group [consiting] consisting 
of macrolides, tetracyclines, polyethers, polyenes, 
ansamycins and derivatives or analogs thereof. 

81. (Amended) The method of claim 57 wherein said DNA 
sequence, designated eryA, encodes a protein having [the] 
enzymatic activities associated with the formation of 6- 
deoxyerythronol ide B . 

82 . (Amended) The method of claim 57 wherein said 
gene -containing DNA sequence encodes one or more proteins 
having enzymatic activities in the rapamycin biosynthetic 
pathway. 

83. (Amended) The method of claim [23] 57 wherein said 
polyketide [macrolide] is a rapamycin analog. 
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REMARKS 

Reconsideration of the above- identified application in 
view of the following arguments is respectfully requested. 

Claims 57, 73, 77, 81, 82 and 83 have been amended. 
No new matter has been added as a result of these 
amendments . 

Restriction Requirement 

Applicants confirm the election of claims 59-62, Groups 
I and II (as the restriction requirement between Groups I 
and II was rescinded) and 57, 58 and 72-83 in part. It is 
respectfully submitted that the claims of Groups I-VI are 
sufficiently closely related as to warrant their examination 
together in the subject application, and that such 
examination will not entail significant additional burden on 
the Examiner. To the extent that the Examiner maintains the 
restriction requirement, Applicants reserve the right to 
file a divisional application to any of the non-elected 
claims . 

Claim Objections 

Claims 57, 73 and 77 are objected to as containing 
misspelled words. Claim 83 is objected to as referring to 
cancelled claim 23 rather than to a pending claim. Claims 
57, 73, 77 and 83 have been amended to overcome these 
obj ections . 

Objections to the Specification 

The specification is objected to, and a substitute 
specification is required, because the Office Action states 
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that the instant specification does not support the subject 
matter of the elected claims. The Office Action states that 
the substitute specification must include descriptions of 
the preparation of products with which practice of the 
elected method for biosynthesis ' of specific polyketides 
requiring alteration of a DNA sequence that encodes a 
polyketide synthase (PKS) , wherein PKS S-ketoreductase, 
dehydratase, and/or enoylreductase domains are in activated 
and/or added, is possible, as described in application 
serial No. 07/642,734, the priority document which is now 
U.S. Patent No. 5,824,513. 

The present application is a continuation of 08/997,467 
which is now U.S. Patent No. 6,200,813, which is a 
divisional of 08/858,003 now U.S. Patent No. 6,060,234, 
which is a continuation-in-part of 07/642,734, now U.S. 
Patent No. 5,824,513. The Office Action specifically stated 
on page 5 that Figure 3, Examples 1-16 of col. 8-13, 
Examples 27-44 of col. 16-21 and at least col. 4, lines 63- 
66 of U.S. Patent No. 5,824,513 must be added to the 
specification. Accordingly, the Figure and Examples have 
been added to the specification. With respect to the 
information in col. 4, lines 63-66 of U.S. Patent No. 
5,824,513, Applicants respectfully submit that this 
information is generally described on page 19 in the 
substitute specification and thus have not incorporated this 
information into the substitute specification. Applicants 
have, however, incorporated by reference the information in 
column 5, lines 17-20. The Applicants maintain that no new 
matter has been included in the amended specification. 
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Rejections Under 35 U.S.C, § 112, First Paragraph 

Claims 59-62, 72-75, 77, 78, 82 and 83 are rejected 
under 35 U.S.C. Section 112, first paragraph as containing 
subject matter that was not described in the specification 
in such a way as to reasonably convey to one skilled in the 
relevant art that the inventor (s), at the time the 
application was filed, had possession of the claimed 
invention. Specifically, the Examiner states that the 
specification fails to describe the design and preparation 
of PKS proteins wherein S-ketoreductase , dehydratase or 
enoylreductase domains other than those of the 
Saccharopolyspora erythraea eryA- encoded PKS are either 
inactivated or added in an altered PKS product by mutation 
of the underlying PKS gene. According to the Examiner, 
"[E]ven if disclosures of the priority document are 
textually incorporated, the instant specification nowhere 
furnishes relevant identifying characteristics of any 
altered PKS but a Saccharopolyspora erythraea eryA- encoded 
PKS with which generic methods of claims 57-60, 72-15, 77, 
78, 82 and 83 might be practiced. Applicant provides no 
disclosure, or any specific suggestion, of an adequate 
number of diverse, heterogeneous, domain species to 
establish an adequate written description of a generic PKS 
product for the practice of methods of the elected claims 
wherein a PKS is modified by inactivation or addition of 
generic PKS S-ketoreductase , dehydratase, or enoylreductase 
domains." Applicants respectfully traverse this rejection. 

The inquiry into whether the description requirement is 
met is determined on a case-by^-case basis and is a question 
of fact. Section 2163.04 Manual of Patent Examining 
Procedure (8 th Edition, August 2001) . When a question 
regarding the adequacy of the written description arises, 
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the fundamental factual inquiry is whether the specification 
conveys to those skilled in the art, as of the filing date 
sought, that Applicant was in possession of the invention 
being claimed. Section 2163.02 Manual of Patent Examining 
Procedure (8 th Edition, August 2001) . Possession can be 
shown in a number of ways. For example, an Applicant can 
show possession by: (1) an actual reduction to practice of 
the claimed invention; (2) a clear depiction of the 
invention in detailed drawings or in structural chemical 
formulas which permit a person skilled in the art to clearly 
recognize that applicant had possession of the claimed 
invention; or (3) any description of sufficient, relevant, 
identifying characteristics so long as a person skilled in 
the art would recognize that the inventor had possession of 
the claimed invention. Section 2163 Manual of Patent 
Examining Procedure (8 th Edition, August 2001) . 

A description as filed is presumed to be adequate 
unless or until sufficient evidence or reasoning to the 
contrary has been presented by the Examiner to rebut the 
presumption. Section 2163.04 Manual of Patent Examining 
Procedure (8 th Edition, August 2001) . The Examiner, 
therefore, must have a reasonable basis to challenge the 
adequacy of the written description. Id, The Examiner has 
the initial burden of presenting by a preponderance of the 
evidence why a person skilled in the art would not recognize 
in an applicants disclosure a description of the invention 
as defined by the claims. Id. 

Applicants respectfully submit that the specification 
as filed is adequate and reasonably conveys to one skilled 
in the relevant art that the inventors, at the time the 
application was filed, had possession of the claimed 
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invention. As admitted by the Examiner, the instant 
specification describes the design and preparation of PKS 
proteins wherein the S-ketoreductase, dehydratase or 
enoyl reductase domains of Saccharopolyspora erythraea eryA 
encoded PKS are either inactivated or added in an altered 
PKS product by mutation of an underlying PKS gene. 
Applicants submit that based upon this description in the 
specification, that those skilled in the art would be 
readily able to prepare such PKS proteins in other 
microorganisms (hosts) other than Saccharopolyspora 
erythraea. Other suitable microorganisms (or host(s)) that 
can be used to produce such PKS products are described on 
pages 3-4 and 10 (see the definition of "polyketide- 
producing microorganism) of the enclosed substitute 
specification. As discussed in column 24 lines 6-32 of U.S. 
Patent 5,824,513 (which is the priority document for the 
above-identified application) , the choice of a convenient 
host is based solely on the relatedness of the novel 
polyketide to a natural counterpart in order to minimize the 
number of module rearrangements required for biosynthesis. 
Specifically, Type I, Type II and Type III alterations can 
be constructed in other microorganisms employing either 
endogenous or exogenous modules to produce novel polyketides 
employing strategies analogous to those described in the 
examples for Saccharopolyspora erythraea. Applicants have 
amended the enclosed substitute specification to include 
this specific text from column 24, lines 6-32 of U.S. Patent 
5, 824, 513 . 

Thereupon, Applicants submit that the Examiner has not 
met his burden of presenting by a preponderance of the 
evidence why a person skilled in the art would not recognize 
in an applicants disclosure a description of the invention 
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as defined by the claims. Therefore, Applicants submit that 
this rejection should be withdrawn. 

Claims 59-62, 72-75, 77, 78, 82 and 83 are rejected 
under 35 U.S.C. Section 112, first paragraph as not being 
enabled by the specification. The Examiner states that the 
specification is not enabling for any method of directing 
the biosynthesis of generic macrolides, tetracyclines, 
polyethers, polyenes, ansamycins, or any analogs or 
derivatives thereof, utilizing a modified, PKS comprising 
the addition of one or more generic S-ketoreductase, 
dehydratase, or enoylreductase domains. Applicants 
respectfully traverse this rejection. 

The test of enablement is whether one reasonably 
skilled in the art could make or use the invention from the 
disclosures in the patent application coupled with 
information known in the art without undue experimentation 
(Manuel of Patent Examining Procedure, 8 th Edition, August 
2001) . 

U.S. Patent 5,824,513 (which is the priority document 
for the above- identified application) describes in detail 
the organization of eryA gene and the DEBS protein encoded 
by said gene (See Figs. 1-2). Additionally, the 
polynucleotide and amino acid sequences of the various 
domains of eryA are described in this patent as well (See 
SEQ ID NOS:l-3) . 

As discussed on page 2 0 of the enclosed copy of the 
substitute specification, methods for identifying polyketide 
synthases, their domains and their functionally similar 
domains can be achieved using routine techniques in the art, 
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such as nucleic acid hybridization under conditions of low 
or high stringency. These techniques allow for the 
identification of any polyketide synthase domain, such as £- 
ketoreductase, dehydratase, or enoylreductase, as well as 
their identification in organisms other than Sac. erythraea 
(As discussed previously, the substitute specification 
describes many polyketide-producing organisms that can be 
used in the present invention.). Once these domains have 
been identified, they can be isolated or synthesized for 
insertion into a polyketide biosynthetic gene-containing DNA 
sequence using routine techniques in the art. Such an 
altered DNA sequence, having an additional polyketide 
synthase domain, would, when expressed, exhibit one or more 
additional enzymatic activities. 

Applicants submit that the Examiner not provided any 
evidence that one reasonably skilled in the art could not, 
using the disclosure in the enclosed substitute 
specification, make the claimed invention based upon the 
enclosed substitute specification without undue 
experimentation. Thereupon, in view of the aforementioned 
arguments, Applicants submit that this rejection should be 
withdrawn. 

Rejections Under 35 U.S.C. § 112, Second Paragraph 

Claims 57-62, 72-83 are rejected under 35 U.S.C. § 112, 
second paragraph, because clauses 2, 3, and 5 of claim 57 
and claims 81 and 82 are construed by the Examiner as 
referring to enzymatic activity associated with a DNA 
sequence, rather than referring to a DNA sequence that 
encodes for an enzyme having activity. Also, claim 57 is 
rejected because "the original sequence" in clause 4 has no 
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antecedent basis. Claims 57, 77, 81, 82 and 83 have been 
amended in response to these rejections. 

Conclusion 

For the foregoing reasons, independent amended claim 57 
and dependent claims 58-62 and 72-83 are in condition for 
allowance. Therefore, reconsideration and allowance are 
requested. 

Should the Examiner have any questions concerning the 
above, she is respectfully requested to contact the 
undersigned at the telephone number listed below. If any 
additional fees are incurred as a result of the filing of 
this paper, authorization is given to charge deposit account 
no. 01-0025. 




Respectfully submitted, 
J. Katz, et al . 



23492 




ABBOTT LABORATORIES 
Telephone: (847) 938-3137 
Facsimile: (847) 938-2623 



Dianne Casuto 
Registration No. 40,943 
Attorney for Applicants 
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MARKED^ ^gMON OF SPECIFICATION SHOWING CHANGES MADE: 

NOVEL POLYKETIDE DERIVATIVES AND RECOMBINANT METHODS FOR MAKING 

SAME 

This application is a continuation application of U.S. Serial No. 08/997,467, filed 
December 23, 1997, now [allowed] U.S. Patent No. 6,200,813 , which is a divisional of U.S. Serial 
Number 08/858,003, filed May 16, 1997, now U.S. Patent No. 6,060,234, which is a continuation- 
in-part of U.S. Serial No. 08/642,734, filed January 17, 1991, now abandoned, all of which are 
incorporated herein by reference in their entirety. 



Technical Field 



The present invention related to novel polynucleotides sequences, proteins encoded 
therefrom which are involved in the biosynthesis of polyketides, methods for directing the 
biosynthesis of novel polyketides using those polynucleotide sequences and novel derivatives 
produced therefrom. In particular, the invention relates to the production of novel polyketide 
derivatives through manipulation of the genes encoding polyketide synthases. 



Background of the Invention 



Polyketides are a large class of natural products that includes many important antibiotic, 
antifungal, anticancer, antihelminthic, and immunosuppressant compounds such as erythromycins, 
tetracyclines, amphotericins, daunorubicins, avermectins, and rapamycins. Their synthesis 
proceeds by an ordered condensation of acyl esters to generate carbon chains of varying length 
and substitution pattern that are later converted to mature polyketides. This process has long been 
recognized as resembling fatty acid biosynthesis, but with important differences. Unlike a fatty 
acid synthase, a typical polyketide synthase is programmed to make many choices during carbon 
chain assembly: for example, the choice of "starter" and "extender" units, which are often 
selected from acetate, propionate or butyrate residues in a defined sequence by the polyketide 
synthase. The choice of using a full cycle of reduction-dehydration-reduction after some 
condensation steps, omitting it completely, or using one of two incomplete cycles (reduction alone 
or reduction followed by dehydration) is additionally programmed, and determines the pattern of 
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keto or hydroxyl groups and the degree of saturation at different points in the chain. Finally, the 
stereochemistry for the substituents at many of the carbon atoms is programmed by the polyketide 
synthase. 

Streptomyces and the closely related Saccharopolyspora genera are producers of a 
prodigious diversity of polyketide metabolites. Because of the commercial significance of these 
compounds, a great amount of effort has been expended in the study of Streptomyces and 
Saccharopolyspora genetics. Consequently, much is known about these organisms and several 
cloning vectors and techniques exist for their transformation. 

Although many polyketides have been identified, there remains the need to obtain novel 
polyketide structures with enhanced properties. Current methods of obtaining such molecules 
include screening of natural isolates and chemical modification of existing polyketides, both of 
which are costly and time consuming. Current screening methods are based on gross properties of 
the molecules, i.e. antibacterial, antifungal activity, etc., and both a priori knowledge of the 
structure of the molecules obtained or predetermination of enhances properties are virtually 
impossible. Chemical modification of preexisting structures has been successfully employed to 
obtain novel polyketides, but still suffers from practical limitations to the type of compounds 
obtainable, largely connected to the poor yield of multistep synthesis and available chemistry to 
effect modifications. Modifications which are particularly difficult to achieve are those involving 
additions or deletions of carbon side chains. Accordingly, there exists a considerable need to 
obtain molecules wherein such changes can be specified and performed in a cost effective manner 
and with high yield. 

The present invention solves there problems by providing reagents (specifically, 
polynucleotides, vectors comprising the polynucleotides and host cells comprising the vectors) 
and methods to generate novel polyketides by de novo biosynthesis rather than by chemical 
modification. 

Summary of the Invention 



In one aspect, the present invention provides compounds of the formula: 
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wherein Ri, R2, R3, R4, R5, and R$ are independently selected from Q wherein Q is selected from 
the group consisting of (a) -H, (b) -Me, (c) -Et, and (d) -OH; Lj and L2 are independently -H or 
-OH; L3 is D-desosamine or -OH; and L4 is L-mycarose, L-cladinose or -OH with the proviso that 
when Ri - R 5 are -Me, R^ is other than -H or -Me. Preferred compounds of the invention are 
those in Q is selected from the group consisting of (a), (b) and (c) above or (a), (b) and (d) above 
or (a), (c) and (d) above or (b), (c) and (d) above or (a) and (b) above or (a) and (c) above or (a) 
and (d) above or (b) and (c) above or (c) and (d) above and Li, L2, L3 and L4 are as defined above. 
Other preferred compounds include those in which Ri, R 2 , R3, R4, R5, and R6 are all -H or -Et or - 
OH and L], L2, L3 and L4 are defined above. Still other preferred compounds include didesmethyl, 
tridesmethyl, tetradesmethyl, pentadesmethyl and hexadesmethyl derivatives of the compounds of 
formula. 

In another aspect, the invention provides host cells transformed with a vector as described 
above. The host cell may be a bacterial cell and preferably is selected from the group consisting 
of E. coli and Bacillus species. Alternatively, the host cell is a polyketide-producing 
microorganism. A preferred polyketide-producing host cell is selected from the group consisting 
of Saccharopolyspora species, Nocardia species, Micromonospora species, Arthrobacter 
species, Streptomyces species, Actinomadura species, and dactylosporangium. species. An even 
more preferred polyketide-producing host cell is selected from the group consisting of 
Saccharopolyspora hirsute, Micromonospora rosaria, Micromonspora megalomicea, 
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Streptomyces antibioticus, Streptomyces mycarofaciens, Streptomyces tsukubaensis, Streptomyces 
fradiae, Streptomyces platensis, Streptomyces violaceoniger, Streptomyces ambofaciens, 
Streptomyces griseoplanus, and Streptomyces Venezuelan Of these host cells, Saccharopolyspora 
erythraea, Streptomyces hygroscopicus, Streptomyces venezuelae, and Streptomyces caelestis are 
most preferred. 

The invention also provides a method for altering the substrate specificity of a polyketide 
synthase in a first polyketide-producing microorganism comprising the steps of (a) isolating a first 
and second genomic DNA segment, each comprising a polyketide synthase wherein the first 
genomic DNA segment is from the first polyketide-producing microorganism and the second 
genomic DNA segment is from the first polyketide-producing microorganism or a second 
polyketide-producing microorganism; 

(b) identifying one or more discrete fragments of the first genomic DNA segment, each 
of which encodes an acyltransferase domain; 

(c) identifying one or more discrete fragments of the second genomic DNA segment, 
each of which encodes a related domain to the acyltransferase domain of the first genomic DNA 
segment; and 

(d) transforming a cell of the first polyketide-producing microorganism with one or 
more of the fragments from step (c) under conditions suitable for the occurrence of a homologous 
recombination event, leading to the replacement of one or more of the fragments from the first 
genomic DNA segment with one or more of the fragments from step (c). In one embodiment, the 
first polyketide-producing microorganism is Saccharopolyspora erythraea and the second 
polyketide-producing microorganism is Streptomyces. Preferred Streptomyces are selected from 
the group consisting of Streptomycesantibioticus, Streptomyces mycarofaciens, Streptomyces 
avermiltilis, Streptomyces hygroscopicus, Streptomyces caelestis, Streptomyces tsukubaensis, 
Streptomyces fradiae, Streptomyces platensis, Streptomyces violaceoniger, Streptomyces 
ambofaciens, and Streptomyces Venezuelan Even more preferred Streptomyces are Streptomyces 
caelestis, Streptomyces hygroscopicus, or Streptomyces venezuelae. In a second embodiment, the 
first polyketide-producing microorganism is a Streptomyces as described above and the second 
polyketide-producing microorganism is Saccharopolyspora erythraea. Also in a preferred 
embodiment, the related domain is selected from the group consisting of SEQ ID NO: 31, SEQ ID 
NO: 32, SEQ ID NO: 33, and SEQ ID NO: 34. 



Brief Description of the Drawings 
The present invention will be more readily appreciated in connection with the 

accompanying drawings. 

FIG. 1 is a proposed metabolic pathway for the biosynthesis of erythromycin A in Sac. 

erythraea. 

FIG. 2 is a schematic representation of the erythromycin PKS. 

FIG. 3 is a Growtree analysis of AT domains from Streptomyces hygroscopicus (S. 
hygroscopicus; LigAT2 and rap ATI -14), Streptomyces venezuelae (S. venezuelae; venAT) and 
Saccharopolyspora erythraea (Sac. erythraea; eryATl-6). 

FIG. 4a is a schematic representation of gene replacements of EryATl with LigAT2 or 
venAT and EryAT2 with LigAT2 in Sac. erythraea, 

FIG. 4b is a schematic representation of gene replacements of EryAT4 with an ethyl AT 
(NidAT5) in Sac. erythraea. 

FIG. 5 is a diagrammatic representation of gene replacement by homologous 
recombination. 

FIG. 6 is a schematic representation of the genetic organization of the Ligase-PKS cluster 
from S. hygroscopicus ATCC 29253. 

FIG. 7 represents the nucleotide sequence (SEQ ID NO: 1, top strand) and corresponding 
amino acid sequence (SEQ ID NO: 31, bottom strand) of LigAT, the malonyl AT domain from 
module 2 of the Ligase-PKS cluster of S. hygroscopicus ATCC 29253. 

FIG. 8 is a diagrammatic representation of the strategy to clone the LigAT2 domain. 

FIG. 9 is a flow diagram depicting the cloning of the EryATl flanking regions in plasmid 

pCS5. 

FIG. 10 is a flow diagram depicting construction of pEryATl/LigAT2. 

FIG. 1 1 is a computer generated Phosphorlmage of a Southern analysis of chromosomal 
DNA from Sac. erythraea ER720 EryATl/LigAT2 resolvants cut with Sphl and probed with an 
approximately 3 kb EcoKUHindlll fragment from pEryATl/LigAT2. As shown in lanes 3, 4 and 
7 the probe hybridized with fragments of 3.5 and 1 .6 kb, indicating that LigAT2 had replaced 
EryATl in the chromosomes of these resolvants (clone #10, #1 1 and #24 respectively). Lanes 5 



and 6: chromosomal DNA from Sac. erythraea ER720 resolvants to wild-type (wt); lanes 1 and 9; 
molecular weight markers (1 kb ladder). 

FIG. 12 is a computer reproduction of a TLC plate on which the products produced by Sac. 
erythraea ER720 EryATl/LigAT2 were run. Lanes 1 and 7: erythromycin A standard (5 jxg); 
lanes 2 and 6: compounds produced by wild-type Sac, erythraea ER720; lanes 3, 4 and 5: 
compounds produced by Sac. erythraea ER720 EryATl/LigAT2 resolvants to mutant-type, 
clones #10, #1 1 and #24 respectively. 

FIG. 13 is a flow diagram depicting the cloning of the EryAT2 flanking regions in plasmid 

pCS5. 

FIG. 14 is a flow diagram depicting construction of pEryAT2/LigAT2. 

FIG. 15 is a computer generated Phosphorlmage of a Southern analysis of chromosomal 
DNA from Sac. erythraea ER720 EryAT2/LigAT2, cut with Sphl and probed with an 
approximately 1 kb LigAT2 sequence. As seen in lane 3, an approximately 900 base pair 
fragment hybridized with the probe, indicating that LigAT2 had replaced EryAT2 in this 
resolvant. Lane 2: chromosomal DNA from wild-type (wt) Sac. erythraea ER720; lanes 1 and 5: 
molecular weight markers (1 kb ladder). 

FIG. 16 is a computer reproduction of a TLC plate on which the products produced by Sac. 
erythraea ER720 EryAT2/LigAT2 were run. Lanes 1 and 6: erythromycin B standard (5 jag); 
lanes 2 and 5: erythromycin A standard (5 (ig); lane 4: compounds produced by wild-type Sac. 
erythraea ER720; lane 3: compounds produced by Sac. erythraea ER720 EryAT2/LigAT2 
resolvant, clone #2-4. 

FIG. 17 is a computer reproduction of a Xerox image of a bioautography plate of products 
made by Sac. erythraea ER720 EryAT2/LigAT2 against S. aureus. Lanes 1 and 7: erythromycin 
B standard (1 \ig); lanes 2 and 6: erythromycin A standard; lane 3: compounds produced by wild- 
type Sac. erythraea ER720; lane 4: extract from an 0.1 mL culture of Sac. erythraea ER720 
EryAT2/LigAT2 resolvant clone #2-4; lane 5: extract from an 0.5 mL culture of Sac. erythraea 
ER720 EryAT2/LigAT2 resolvant clone #2-4. 

FIG. 18 represents the nucleotide sequence (SEQ ID NO: 2, top strand) and corresponding 
amino acid sequence (SEQ ID NO: 32, bottom strand) of venAT, the malonate AT domain from 
the PKS cluster (hereinafter designated pven4) from S. venezuelae ATCC 15439. 

FIG. 19 is a diagrammatic representation of the strategy to clone the venAT domain. 
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FIG. 20 is a flow diagram depicting construction of pEryATl/venAT. 

FIG. 21 is a computer generated Phosphorlmage of a Southern analysis of chromosomal 
DNA from Sac. erythraea ER720 EryATl/ventAT resolvants, cut with Pvull and probed with a 
venAT sequence. As seen in lanes 4 and 5, the probe hybridized with fragments of 4.2 and 2.4 kb, 
indicating that venAT had replaced EryATl in these resolvants. Lane 1 : molecular weight 
markers (1 kb ladder) lane 2: chromosomal DNA from a wild-type Sac. erythraea ER720; lane 3: 
chromosomal DNA from a Sac. erythraea ER720 EryATl /venAT integrant; lane 4: chromosomal 
DNA from Sac. erythraea ER720 EryATl/venAT resolvant clone #C.l; lane 5: chromosomal 
DNA from Sac. erythraea ER720 EryATl/venAT resolvant clone #C.4. 

FIG. 22 is a computer reproduction of a TLC plate on which the products produced by Sac. 
erythraea ER720 EryATl/venAT were run. Lane 1: erythromycin A standard (EryA; 5 jig) and 3- 
a-mycarosylerythronolide B (MEB; 10 jig); lane 2: compounds produced by Sac. erythraea ER 
720 EryATl/venAT resolvant clone #C A 

FIG. 23 is a diagrammatic representation of the strategy to clone the rapAT14 domain. 

FIG. 24 is a flow diagram depicting construction of pEryATl/rapAT14. 

FIG. 25 is a computer generated Phosphorlmage of a Southern analysis of chromosomal 
DNA from an Sac. erythraea ER720 EryATl/rapAT14 resolvant, cut with Styl and probed with an 
EcoRI-Hindlll fragment from pCSSATl -flank. As shown in lane 2 the probe hybridized with a 
1.6 kb fragment indicating that rapAT14 had replaced EryATl in the chromosome of this 
resolvant. Lane 1: molecular weight markers (1 kb ladder); lane 2: chromosomal DNA from Sac. 
erythraea ER720 EryATl/rapAT14 resolvant clone #4-A(l); lane 3: chromosomal DNA from 
wild-type Sac. erythraea ER720. 

FIG. 26 is a computer reproduction of a TLC plate on which the products produced by Sac. 
erythraea ER720 EryATl/rapAT14 were run Lane 1 : erythromycin A standard (5 |xg); lane 2: 
compounds produced by Sac. erythraea ER720 EryATl/rapAT14 resolvant. 

FIG. 27 is a flow diagram depicting construction of pEryAT2/rapAT14. 

FIG. 28 is a computer generated Phosphorlmage of a Southern analysis of chromosomal 
DNA from Sac. erythraea ER720 EryAT2/rapAT14 resolvants, cut with BspEl and probed with a 
fragment of 5 '-flanking region of eryAT2. As shown in lanes 5, 6 and 7, the probe hybridized 
with a 4.3 kb fragment, indicating that rapATH had replaced EryAT2 in the chromosomes of 
these resolvants. Lane 1 : molecular weight markers (1 kb ladder); lane 2: chromosomal DNA 



from wild-type Sac. erythraea ER720; lane 3: chromosomal DNA from Sac. erythraea ER720 
resolvant to wild-type, clone #1.1; lane 4: chromosomal DNA from a Sac. erythraea ER720 
EryAT2/rapAT14 integrant; lanes 5-7: chromosomal DNA from Sac. erythraea ER720 
EryAT2/rapAT14 resolvant clones #1.2, #1.3 and #1.4 respectively. 

FIG. 29 is a computer reproduction of a TLC plate on which the products produced by Sac. 
erythraea ER720 EryAT2/rapAT14 were run. Lane 1: erythromycin A and erythronolide B (EryA 
and EB, respectively; 5 (ag each); lane 2: compounds produced by wild-type Sac. erythraea 
ER720; lanes 3-5: compounds produced by Sac. erythraea ER720 EryAT2/rapAT14 resolvant 
clones #1.2, #1.3 and #1.4 respectively. 

FIG. 30 is a computer reproduction of a bioassay of compounds made by Sac. erythraea 
ER720 EryAT2/rapAT14 resolvant clones #1.2, #1.3 and #1.4 and resolvant to wild-type clone 
#1.1. 

FIG. 31 is a computer generated Phosphorlmage of a Southern analysis of a cosmid DNA 
library constructed from Streptomyces caelestis NRRL-2821 chromosomal DNA. Lanes 1-19: 
DNA prepared from 19 clones, digested with Sstl and probed with a Streptomyces caelestis 
NRRL-2821 PKS specific probe. 

FIG. 32 is a schematic representation of the genetic organization of the PKS cluster from 
Streptomyces caelestis NRRL-2821. 

FIG. 33 is a diagram of the structure of the macrolide ring of niddamycin. 

FIG. 34 represents the nucleotide sequence (SEQ ID NO:29, top strand) and corresponding 
amino acid sequence (SEQ ID NO:33, bottom strand) of NidAT5, the ethyl AT domain from 
module 5 of the PKS cluster of Streptomyces caelestis NRRL-2821. 

FIG. 35 is a flow diagram depicting the construction of pUC/ethAT/C-6. 

FIG. 36 is a diagram showing the nucleotide changes made to create an ^4vrII site at the 5' 
end ofNidATS. 

FIG. 37 is a diagram of the replacement plasmid pEAT4. 

FIG. 38 is a computer generated Phosphorlmage of a Southern analysis of chromosomal 
DNA from Sac. erythraea ER720 EAT4 resolvants digested with Mlul and probed with a 900 bp 
DNA fragment spanning a KS/AT domain in Streptomyces caelestis NRRL-2821 . Lane 
assignments are as follows: 1) wild type ER720; 2-7) resolvant clones. The resolvants with the 
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NidATS domain in place of EryAT4 produced a strongly hybridizing 1.8kb fragment (lanes 4, 5, 
and 7) which is missing in clones which resolved back to wild type (lanes 2, 3, and 6). 

FIG. 39 is a computer reproduction of a TLC plate showing the products made by Sac. 
erythraea EAT4-46 after growing in SCM or SCM + 50 mM butyric acid. 

FIG. 40 is a computer generated Phosphorlmage of a Southern analysis of clones from a 
cosmid DNA library constructed from Streptomyces caelestis NRRL-2821 chromosomal DNA. 
Cones were digested with Smal and probed with a 900 bp DNA fragment spanning a KS/AT 
domain in Streptomyces caelestis NRRL-2821. 

FIG. 41 represents the nucleotide sequence (SEQ ID NO:30, top strand) and corresponding 
amino acid sequence (SEQ ID NO:34, bottom strand) of NidAT6, the AT domain in module 6 of 
the niddamycin PKS cluster. 

FIG. 42 is a diagrammatic representation of the strategy to clone the NidAT6 domain. 

FIG. 43 is a flow diagram depicting construction of pEryAT2/NidAT6. 

FIG. 44 is a schematic representation of Type I, Type II and Type III changes in eryA and 
structures of corresponding novel polyketides produced. A69 (Type I) and A3 3 (Type II) 
represent in- frame deletions of the base pairs in the DNA segments corresponding to the KR of 
module 2 and the 3-ketoacyl ACP synthase of module 2, respectively. Insertion of a complete 
copy of module 4 within module 1 is also shown. Production of 1 l-epifluoro-15-norerythromycin 
in strain that carries A33 occurs when substrate analog (2S, 3S, 4S, 5S) 2,4-dimethyl-3-fluoro-5- 
hydroxyhexanoic acid-ethyl thioester is fed. 

DETAILED DESCRIPTION OF THE INVENTION 

I. Definitions: 

For the purposes of the present invention as disclosed and claimed herein, the following 
terms are defined: 

The term "polyketide" as used herein refers to a large and diverse class of natural products 
including but not limited to antibiotic, anticancer, antihelminthic, antifungal, pigment, and 
immunosuppressant compounds. Antibiotics include but are not limited to anthracyclines, 
tetracyclines, polyethers, polyenes, ansamycins, and macrolides of various types such as 
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avermectins, erythromycins, and niddamycins. The term polyketide is also intended to refer to 
compounds of this class that can be used as intermediates in chemical syntheses. For example, 
erythromycin A is a polyketide that is isolated and used in the synthesis of the antibiotic 
clarithromycin. Polyketides used as intermediates do not themselves necessarily have any 
biological or thereapeutic activity. 

The term "polyketide-producing microorganism" as used herein includes but is not limited 
to bacteria from the order Actinomycetales, Myxococcales or other Eubacteriales that can produce 
a polyketide. Examples of actinomycetes and myxobacteria that produce polyketides include but 
are not limited to Saccharopolyspora erythraea, Saccharopolysprora hirsute, Micromonospora 
rosaria, Micromonospora megalomicea, Sorangium cellulosum, Streptomyces antibioticus, 
Streptomyces mycarofaciens, Streptomyces avermitilis, Streptomyces hygroscopicus, Streptomyces 
caelestis, Streptmyces tsukubaensis, Streptomyces fradiae, Streptomyces platensis, Streptomyces 
violaceoniger, Streptomyces ambofaciens, Streptomyces venezuelae and various other 
Streptomyces, Actinomadura, Dactylosporangium and Amycolotopsis strains that produce 
polyketides. Yeast and fungi that produce polyketides are also considered "polyketide-producing 
microorganisms". Examples of fungi that produce polyketides include but are not limited to 
members of the genus Aspergillus, 

The term "polyketide synthase" (PKS) as used herein refers to a complex of enzyme 
activities responsible for the biosynthesis of polyketides. The enzymatic activities contained 
within a PKS include but are not limited to P-ketoreductase (KR), dehydratase (DH), 
enoylreductase (ER) 5 p-ketoacyl ACP synthase (KS), acyl carrier protein (ACP), acyltransferase 
(AT) and thioesterase (TE). The polypeptide fragment responsible for each enzymatic activity is 
referred to as a "domain". A "module" refers to a group or set of domains which carry out one 
condensation step in the process of polyketide formation and may or may not include domains 
which effect processing of the P-carbonyl group in the growing polyketide. 

The term "Type I PKS" as used herein refers to a PKS which is a large multifunctional 
protein and is exemplified by DEBS (see below). The term "Type II PKS" refers to a PKS having 
several separate, largely monofunctional enzymes, and is exexmplified by the PKSs responsible 
for the biosynthesis of actinorhodin and tetracenomycin (C.R. Hutchinson and 1.1 Fujii, Annu. Rev. 
Microbiol. 49:201-238 (1995)). 
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The term "cognate domains" as used herein refers to the members of a specific set of 
domains which constitute a naturally occurring single module. 

The term "related domain" or "heterologous domain" as used herein refers to a PKS 
domain which is functionally similar to a second PKS domain. By "Functionally similar" it is 
meant that each domain catalyzes a particular type of reaction but acts upon a different substrate. 
For example, the AT domain of module 1 of Sac. erythraea (eryATl) and the AT domain of 
module 14 of S. hygroscopicus (rapAT14) both catalyze the transfer of an extender unit to a 
corresponding ACP domain. In the case of Sac. Erythraea, however, eryATl utilizes 
methylmalonyl Co A as a substrate whereas in S. hygroscopicus, rapAT14 utilizes malonyl Co A. 
Thus, eryATl and rapAT14 are considered to be "related" or "heterologous" domains. 

The term "condensation" as used herein refers to the addition of an extender unit to the 
nascent polyketide chain and requires the action of KS, AT and ACP domains of the PKS. 

The term "starter" as used herein refers to a coenzyme A thioester of a carboxylic acid 
which is used by a polyketide synthase as the first building block of the polyketide. 

The term "extender" as used herein refers to a coenzyme A thioester of a dicarboxylic acid 
that is incorporated into a polyketide by a polyketide synthase at positions other than the first 
position. 

The term "DEBS" as used herein refers to the enzyme 6-deoxyerythronolide B synthase, 
the PKS that builds the polyketide-derived macrolactone 6-deoxyerythronolide B (6-DEB). 

The term "eryA " as used herein refers to the genes which encode the DEBS. 

The term "homologous recombination" as used herein refers to crossing over between 
DNA strands containing identical sequences. 

The term "isolated" as used herein means that the material is removed from its original 
environment (e.g. the natural environment where the material is naturally occurring). For 
example, a naturally occurring polynucleotide or polypeptide present in a living animal is not 
isolated, but the same polynucleotide or polypeptide, which is separated from some or all of the 
coexisting materials in the natural system, is isolated. Such polynucleotides could be part of a 
vector and/or such polynucleotides or polypeptides could be part of a composition, and still be 
isolated in that the vector or composition is not part of the natural environment. 

The term "restriction fragment" as used herein refers to any linear DNA generated by the 
action of one or more restriction enzymes. 



11 



The term "replicon" as used herein means any genetic element, such as a plasmid, 
chromosome or virus, that behaves as an autonomous unit of polynucleotide replication within a 
cell. A "vector" is a replicon in which another polynucleotide fragment is attached, such as to 
bring about the replication and/or expression of the attached fragment. 

The terms "recombinant polynucleotide" or "recombinant polypeptide" as used herein 
means at least a polynucleotide or polypeptide which by virtue of its origin or manipulation is not 
associated with all or a portion of the polynucleotide or polypeptide with which it is associated in 
nature and/or is linked to a polynucleotide or polypeptide other than that to which it is linked in 
nature. 

The term "host cell" as used herein, refers to both prokaryotic and eukaryotic cells which 
are used as recipients of the recombinant polynucleotides and vectors provided herein. 

The term "open reading frame" or "ORF" as used herein refers to a region of a 
polynucleotide sequence which encodes a polypeptide; this region may represent a portion of a 
coding sequence or a total coding sequence. 

II. The Invention 

In its broadest sense, the present invention entails novel polyketides with thereapeutic 
activity (e.g. antimicrobial, anticancer, antifungal, immunosuppressant and/or antihelminthic 
activity) and immediate compounds of such polyketides. The invention also provides a method 
for producing novel polyketides in vivo by selectively altering the genetic information of an 
organism that naturally produces a polyketide. The present invention further provides isolated and 
purified polynucleotides that encode PKS domains (i.e. polypeptides) from polyketide-producing 
microorganisms, fragments thereof, vectors containing those polynucleotides, and host cells 
transformed with those vectors. These polynucleotides, and hot cells transformed with those 
vectors. These polynucleotides, fragments thereof, and vectors comprising the polynucleotides 
can be used as reagents in the above described method. Portions of the polynucleotide sequences 
disclosed herein are also useful as primers for the amplification of DNA or as probes to identify 
related domains from other polyketide-producing microorganisms. 

III. Polynucleotides 
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The present invention provides isolated and purified polynucleotides that encode PKS 
domains (i.e. polypeptides) and fragments thereof which are involved in the production of 
polyketides. Polynucleotides included within the scope of the invention may be in the form of 
RNA, DNA cDNA, and genomic DNA and synthetic DNA. The DNA may be double-stranded or 
single-stranded, and if single-stranded may be the coding (sense) strand or non-coding (anti-sense) 
strand. The coding sequence which encodes a polypeptide may be identical to a coding sequence 
provided herein or may be a different coding sequence which, as a result of the redundancy or 
degeneracy of the genetic code, encodes the same polypeptide as the DNA provided herein. 

Polynucleotides may include only the coding sequence for a particular polypeptide or for a 
polypeptide which is functionally equivalent to the polypeptide sequences provided herein. 
Additionally, the invention includes variant polynucleotides containing modifications such as 
polynucleotide deletions, substitutions or additions; and any polypeptide modification resulting 
from the variant polynucleotide sequence. A polynucleotide of the present invention also may 
have a coding sequence which is a naturally occurring allelic variant of the coding sequence 
provided herein. 

Probes and primers constructed according to the polynucleotide sequences provided herein 
are also contemplated as within the scope of the present invention and can be used in various 
methods to provide various types of analysis. For example, primer sequences may be designed 
according to polynucleotide sequences which encode particular domains and then used to amplify 
polynucleotide sequences of the same or other related domains using well-known amplification 
techniques such as the polymerase chain reaction (PCR) and the ligase chain reaction (LCR). 
(PCR has been disclosed in U.S. patents 4,683,195 and 4,683,202, and LCR, in EP-A-320 308 to 
K. Backman published June 16, 1989 and EP-A-439 182 to K. Backman et al., published July 31, 
1991, all of which are incorporated herein by reference). Generation of primers for use in other 
amplification techniques or in variations of these amplification techniques, (such as nested PCR) 
is also contemplated within the scope of the invention and is considered within the knowledge of 
the routine practitioner. 

Probes and primers may be designed from conserved nucleotide regions of a 
polynucleotide of interest or from non-conserved nucleotide regions of a polynucleotide of 
interest. Generally, nucleic acid probes are developed from non-conserved or unique regions 
when maximum specificity is desired, and nucleic acid probes are developed from conserved 
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regions when assaying for nucleotide regions of related members of a multigene family or in 
related species. Probes can also be labeled with radioisotopes or other detection labels for 
screening of recombinant libraries. 

Various methods for synthesizing primers and probes are well-known in the art as are 
methods for attaching labels to primers or probes. For example, it is a matter of routine to 
phosphoramidite chemistry and instruments available from Applied Biosystems, Inc., (Foster City, 
CA), Dupont (Wilmington, DE), or Milligen (Bedford MA). Many methods have been described 
for labeling oligonucleotides such as the primers or probes of the present invention. 
Commercially available probe labeling kits include those from Amersham Life Science (Arlington 
Heights, IL), Promega (Madison, WI), Enzo Biochemical (New York, NY) and Clontech (Palo 
Alto, CA). 

IV. Vectors and Host Cells 

The present invention provides vectors which include polynucleotides of the present 
invention and host cells which are genetically engineered with vectors of the present invention. 

a. Vectors and Expression Systems 

The present invention includes recombinant constructs comprising one or more of the 
sequences as broadly described above. The constructs comprise a vector, such as a plasmid or 
viral vector, into which a sequence of the invention has been inserted, in a forward or reverse 
orientation. Such vectors include chromosomal, nonchromosomal and synthetic DNA sequences 
from prokaryotic or eukaryotic sources. Large numbers of suitable plasmids and vectors are 
known to those of skill in the art, and are commercially available. Vectors which are particularly 
useful for cloning and expression in intermediate hosts include but are not limited to: (a) Bacterial: 
pBR322 (ATCC 37017); pGEM (Promega Biotec, Madison, WI), pUC, Psportl and pProExl (Life 
Technologies, Gaithersburg, MD); pQE70, pQE60, pQE-9 (Qiagen); pBs, phagescript, psiX174, 
pBluescript SK, pBsKS, pNH8a, pNH16a, pNH18a, pNH46a Stratagene®, La Jolla, CA); 
pTrc99A, pKK223-3, pKK233-3, pKK233-3, pDR540, pRIT5, and pGEX4T (Pharmacia®, 
Piscataway, NJ); and (b) Eukaryotic: pWLneo, pSV2cat, pOG44, pXTl, pSG (Stratagene®); 
pSVK3, pBPV, pMSG, pSVL (Pharmacia®); pcDNA3.1 (invitrogen, Carlsbad, CA). Other 
appropriate cloning and expression vectors for use with prokaryotic and eukryotic hosts are 
described by Maniatis, et al., Molecular Cloning: A Laboratory Manual , Second Edition, (Cold 
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Spring Harbor Press, N.Y., 1982), which is hereby incorporated by reference. Generally however, 
any plasmid or vector may be used as long as it is replicable and viable in a host. 

In another embodiment, the construct is an expression vector which also comprises 
regulatory sequences operably linked to the sequence of interest, to direct mRNA synthesis and 
polypeptide production. Regulatory sequences known to operate in prokaryotic and/or eukeryotic 
cells include inducible and non-inducible promoters for regulating mRNS transcription, ribosome 
binding sites for translation initiation, stop codons for translation termination and transcription 
terminators and/or polyadenlations signals. In addition, an expression vector may include 
appropriate sequences for amplifying expression. 

Promoter regions may be selected from any desired gene. Particular named bacterial 
promoters include lacZ, gpt, lambda P R , lambda Pl, trc, trp, ermE and its derivatives such as 
ermEPl ATGG, also known in the art as ermE*, (Bibb, M.J., et al, Molecular Microbiology, 
14(3): 533-545 (1994)), melCl, and acII(CM. Kao, et al., Science, 265: 509-512 (1994)). 
Eukaryotic promoters include cytomegalovirus (CMV) immediate early, herpes simplex virus 
(HSV) thymidline kinase, early and late SV40, LTRs from retroviruses, mouse metallothionein-I, 
prion protein and neuronal specific enolase (NSE). Selection of the appropriate promoter is well 
within the level of ordinary skill in the art. In addition, a recombinant expression vector will 
include an origin of replication and selectable marker (such as a gene conferring resistance to an 
antibiotic (eg. Neomycin, chloramphenicol, ampicillin, or thiostrepton) or a reporter gene (eg. 
Luciferase)) which permit selection of stably transformed or transfected host cells. 

In any expression vector, a heterologous structural sequence (i.e. a polynucleotide of the 
present invention) is assembled in appropriate phase with translation initiation and termination 
sequences. Optionally, the heterologous sequence will encode a fusion protein including an N- 
terminal identification peptide imparting desired characteristics, e.g., stabilization or simplified 
purification of expressed recombinant product. 

Eukaryotic expression vectors will also generally comprise an origin of replication, a 
suitable promoter operably linked to a sequence of interest and also any necessary translation 
enhancing sequence, polyadenylation site, transcriptional termination sequences, and 5' flanking 
nontranscribed sequences. DNA sequences derived from the SV40 viral genome, for example, 
S V40 origin, early promoter, enhancer, and polyadenyaltion sites may be used to provide the 
required genetic elements. Such vectors may also include an enhancer sequence to increase 
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transcription of a gene. Enhancers are cis-acting elements of DNA, usually about from 10 to 300 
bp, that act on a promoter to increase its transcription rate. Examples include the S V40 enhancer 
on the late side of the replication origin (bp 100 to 270), a cytomegalovirus early promoter 
enhancer, a polyoma enhancer on the late side of the replication origin, and adenovirus enhancers. 

i. Vector construction 

The appropriate DNA sequence may be inserted into a vector by a variety of procedures. 
Generally, site-specific DNA cleavage is performed by treating the DNA with suitable restriction 
enzymes under conditions which are generally specified by the manufacturer of these 
commercially available enzymes. Usually, about 1 microgram (jig) of plasmid or DNA sequence 
is cleaved by 1 unit of enzyme in about 20 microliters (|iil) of buffer solution by incubation at 
37°C for 1 to 2 hours. After incubation with the restriction enzyme, protein can be removed by 
phenol/chloroform extraction and the DNA recovered by precipitation with ethanol. The cleaved 
fragments may be separated using polyacrylamide or agarose gel electrophoresis, according to 
methods known by the routine practitioner. (See Maniatis et al., supra). 

Ligations are performed using standard buffer and temperature conditions and with a ligase 
(such as T4 DNA ligase) and ATP. Sticky end ligations require less ATP and less ligase than 
blunt end ligations. Vector fragments may be treated with bacterial alkaline phosphatase (BAP) or 
calf intestinal alkaline phosphatase (CIAP) to remove the 5' -phosphate and thus prevent religation 
of the vector. Ligation mixtures are transformed into suitable cloning hosts such as E. coli and 
successful transformants selected by methods including antibiotic resistance, and then screened for 
the correct construct. 

ii. Trans formati on/Trans fection 

Transformation or transfection of an appropriate host with a construct of the invention, 
such that the host produces recombinant polypeptides, may also be performed in a variety of ways. 
For example, a construct may be introduced into a host cell by calcium chloride or polyethylene 
glycol transformation, lithium chloride or calcium phosphate transfection, DEAE-Dextran 
mediated transfection, or electroporation. These and other methods for transforming/transfecting 
host cells are well known to routine practitioners (see L. Davis et al., "Basic Methods in 
Molecular Biology", 2 nd edition, Appleton and Lang, Paramount Publishing, East Norwalk, CT 
(1994) and D.A. Hopwood et al., Genetic Manipulation of Streptomyces: a laboratory manual, The 
John Innes Foundation, Norwich, England (1985)). 



16 



b. Host Cells 

In one embodiment, the present invention provides host cells containing recombinant 
constructs as described below. In one aspect, a host cell may be an "intermediate" host which is 
used to produce polynucleotides of the invention on a large-scale basis (for the purpose of cloning 
and/or verifying recombinant polynucleotide sequences, for example) or as a means to maintain 
such polynucleotide sequences over time (i.e. as maintenance or storage strains). A "production" 
host is a host cell which is used to produce novel polyketides. The host cell (either intermediate or 
production) can be a higher eukeryotic cell, such as a mammalian cell, or a lower eukaryotic cell, 
such as a yeast cell, or a prokaryotic cell, such as a bacterial cell. Lower eukaryotic and 
prokaryotic cells are preferred intermediate and production hosts. 

Representative examples of appropriate hosts include bacterial cells, such as E. coli, 
Bacillus subtilis, Saccharopolyspora erythraea, Streptomyces caelestis, Streptomyces 
hygroscopicus, Streptomyces venezuelae; and various other species within the genera 
Arthrobacter, Micromonospora, Nocardia, Pseudomonas, Streptomyces, Staphylococcus, and 
Saccharopolyspora, although others (of eukaryotic origin) may also be employed. Additional 
representative examples of host cells are polyketide-producing microorganisms (as defined 
above). The selection of an appropriate host is deemed to be within the scope of those skilled in 
the art from the teachings provided herein. 

Host cells are genetically engineered (transduced, transformed, transfected, conjugated, or 
electroporated) with the vectors of this invention which may be a cloning vector or an expression 
vector. The engineered host cells can be cultured in conventional nutrient media modified as 
appropriate for activating promoters, selecting transformants, or as a source of a biosynthetic 
substrate. The culture conditions, such as temperature, pH and the like, are those previously used 
with the host cell selected for expression, and will be apparent to the ordinarily skilled artisan. 

V. Novel Polyketides and Methods of Making Novel Polyketides 

The invention also provides novel polyketides, intermediate compounds thereof, and 
methods for producing novel polyketides. The methods utilize the polyketide biosynthetic genes 
from Sac. erythraea (i.e. the eryA genes) as well as those from other known polyketide-producing 
microorganisms and/or putative polyketide-producing microorganisms (i.e. those having 
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nucleotide sequences which hybridize to known PKS sequences but whose polyketide products are 
unknown). 

The organization of eryA and the DEBS encoded therefrom (see FIG. 1 and FIG. 2) have 
been described in co-pending U.S. application Serial No. 07/642,734, filed January 17, 1991, 
which is incorporated herein by reference in its entirety. As FIG. 2 shows, DEBS is organized in 
modules, with each module being responsible for one condensation step through the action of the 
resident KS, AT and ACP domains within that module wherein an extender unit, methylmalonyl 
CoA, is added first to the starter unit, propionyl CoA, and the successively to the growing acyl 
chain. The precise succession of the elongation steps is dictated by the order of the six modules: 
module 1 determines the first condensation; module 2, the second; module 3, the third, and so on 
until the sixth condensation step has occurred. In addition, the choice of extender unit that is 
incorporated into a growing polyketide chain at each condensation is determined, in whole or in 
part, by the AT domain within each module. In the case of DEBS, the extender unit incorporated 
is always methylmalonate. Thus, as 6-deoxyerythronolide B grows through successive 
condensations, , two carbons are added to the nascent chain and every other carbon, starting with 
the carbon corresponding to C-12 in the ring, carries a methyl group as a side chain. 

As also seen in FIG. 2, the processing of the growing carbon chain after each condensation 
is determined by the information within each module. Thus, p-ketoreduction of the p-keto group 
generated by the condensation event takes place after each condensation step except the third, as 
determined by the presence of an active KR domain in each nodule except module 3, whereas 
dehydration and enoylreduction take place after the fourth condensation step, as determined by the 
presence of the DH and ER domains in module 4. Once the polyketide chain is fully synthesized, 
it is released from the PKS through the action of the TE domain present at the end of module 6 
and cyclizes to form the macrocyclic lactone 6-deoxyerythronolide B which is subsequently acted 
upon by a series of other enzymes, whose genes reside in the erythromycin cluster of the Sac. 
erythraea chromosome (see FIG. 1). As shown in FIG. 1, erythromycin carries methyl side chains 
at position 2, 4, 6, 8, 10 and 12, through the incorporation of methylmalonate as the extender unit 
at each step of synthesis of the polyketide moiety. 

In the present invention, novel polyketide molecules of a desired structure are produced by 
introducing specific genetic alterations into a PKS-encoding sequence in the genome of a 
polyketide-producing microorganism. Alteration of one or more genes or fragments thereof may 
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be generated through manipulation of genes residing exclusively within a species (i.e. intraspecies 
alterations), and include not only manipulations of genes within a single PKS cluster but also 
between different PKS clusters residing within a single strain (as is seen in S. hygroscopicus). 
Several examples of intraspecies alterations showing the manipulation of genes exclusively within 
a single PKS (namely, eryA) are described in U.S. application Serial No. 07/624,734, cited supra. 
Alternatively, a gene or fragment thereof may be exchanged with a heterologous gene or gene 
fragment encoding one or more related domains from the PKS of a different polyketide-producing 
microorganism (interspecies alterations). Several examples of novel polyketides produced from 
exchange of heterologous genes are provided herein. 

Whether the genetic manipulations are performed intraspecies or interspecies, three types 
of alterations to a PKS sequence may be carried out: (i) those which affect a module but do not 
cause the arrest of chain growth (Type I alterations); (ii) those which affect a single function in a 
module thereby causing the arrest of chain growth (Type II alterations); and (iii) those which 
affect an entire module (Type III alterations). In one embodiment, Type I alterations are produced 
by inactivation of domains that specify the functional groups and/or degree of exidation found at 
specific ring positions in the native polyketide. Such domains typically include p-ketoreductases, 
dehydratases and enoylreductases. For example, an allele corresponding to p-ketoreductase of 
module 5 may be mutated by deleting a substantial portion of the DNA encoding the P- 
ketoreductase (thereby producing an inactive domain) and used to replace the wild-type allele in 
the native strain. Such a transfer results in the production of the novel polyketide 5-oxo-5, 6- 
dideoxy-3-<*>-mycarosyl erythronolide B. 

In an alternative embodiment, Type I alterations are generated by replacing at least one 
domain in a particular PKS with at least one related domain from the same or a second PKS. Such 
related domains may exist between different polyketide-producing microorganisms (such as for 
example, the AT domains of Sac. erythraea, S. venezuelae, S. hygroscopicus, and S. caelestis) or 
within a single species (as for example, the LigAT2 and rapATl domains in S. hygroscopicus). 

A schematic representation of some examples of Type I, Type II and Type III alterations in 
eryA and the corresponding novel polyketides produced in hosts that carry such alterations is 
shown in FIG. 44. 

It will also occur to those skilled in the art that genetic manipulations described herein 
need not be limited to Sac, erythraea. Suitable hosts are any other polyketide-producing 
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Actinomyces where DNA can be precisely inserted into the chromosome. Hence, the choice of a 
convenient host is based solely on the relatedness of the novel polyketide to a natural counterpart 
so as to minimize the number of module rearrangements required for its biosynthesis. Therefore, 
Type I, Type II and Type III alterations can be constructed in other Actinomyces employing either 
endogenous or exogenous modules to produce novel polyketides employing strategies analogous 
to those described herein for Sac, erythraea. Thus all Type I, Type II or Type HI mutations or 
various combinations thereof constructed in other actinomycetes according to the principles 
described herein, and the respective polyketides produced from such strains, are included within 
the scope of the present invention. Examples of polyketides that can be altered by creating Type I, 
Type II or Type III changes in the producing microorganisms include, but are not limited to 
macrolide antibiotics such as erythromycin, tylosin, spiramycin, etc.; ansamacrolides such as 
rifamycins, maytansines, etc.; polyketide antibiotics such as tetracycline; polyethers such as 
monesin, salinomycin, etc.; polyenes such as candicidin, amphotericins; immunosuppressants 
such as FK506, ascomycin, rapamycin, etc. and other complex polyketides such as avermectin. 

Ways to identify polyketide synthases, their domains and the functional similarity of 
domains are well-known to those of ordinary skill in the art. For example, the PKS region of the 
chromosome of a polyketide or putative polyketide-producing microorganism may be identified 
by hybridizing with nucleic acid probes under conditions of low or high stringency. Hybridization 
under high stringency conditions is generally performed in a buffer consisting of 15 mM sodium 
chloride and 1.5 mM trisodium citrate (0.1 x SSC) with an incubation temperature of about 65°C 
(see for example, Maniatis, et al. supra). To detect more distantly related PKS genes, 
hybridization is performed under low stringency conditions which include lower temperature 
incubations and/or the presence of increased amounts of sodium chloride and trisodium citrate 
(Maniatis, et al. supra). Once identified, the chromosomal region may be isolated, cloned into a 
suitable vector and sequenced, using conventional methods or commercial sequencing kits such as 
Sequenase (US Biochemical Corp, Cleveland, OH). Methods for isolating and cloning 
chromosomal DNA are also well known in the art (Maniatis, et al. supra). An amino acid 
sequence may then be deduced from the DNA sequence and a comparison made of the unknown 
amino acid sequence to that of one or more polypeptides involved in polyketide biosynthesis. 
Two amino acid sequences showing at least about 29% and more preferably about 25% identity 
and having conserved active site residues or motifs are considered to specify functionally similar 
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or equivalent PKS domains. Having identified such domains, the number and composition of 
modules as well as the arrangement of modules within particular ORFs can be determined. 

In the case where the newly defined PKS produces a polyketide of known structure, the p- 
carbonyl processing and types of side chain moieties and their positioning on the polyketide 
backbone can be correlated to specific domains within modules. Because modules are established 
linearly within ORFs, this correlation also allows one to determine the order of modular activity 
(i.e. which module catalyzes which condensation step) in the PKS. For example, the P-carbonyl 
processing and types of side chain moieties in the polyketide generates a pattern of chemical 
groups that can be correlated to a pattern of domains within an ORF. Based on the specific type of 
side chain moiety at a given carbon, one can then predict the particular substrate utilized by that 
module's AT domain. 

In the case where the polyketide structure is unknown, theoretically, comparative sequence 
analysis alone may be used to predict the substrate specificity of an AT domain. To accomplish 
this, at least two and preferably, three or more sequences known or predicted to specify a 
particular substrate can be compared to determine on or more conserved or consensus motifs 
unique to that family of ATs. An unknown AT having such motifs can then be assigned to a 
particular family. 

Alternatively, comparative analyses can be performed using computer programs which 
group AT domains based on primary amino acid sequence similarity of phylogenetic relationships. 
For example, comparative analyses were made of the amino acid sequences of the AT domains in 
DEBS with corresponding AT domains in the PKS for rapamycins to determine whether the 
extender unit used by a particular AT domain, (either malonate or methylmalonate), correlated 
with the degree of sequence identity between these domains. Rapamycin is a large polyketide that 
is assembled through 14 condensation events; the rapamycins PKS possesses 14 AT domains 
whose sequences were deducted from known nucleotide sequences (Aparicio et al. Gene 169:9-16 
(1996)). Amino acid sequence comparisons of the 14 AT domains of the rapamycins PKS with 
each other and with the 6 AT domains from DEBS, showed that the AT domains fell into two 
distinct groupings in which the rapamycins AT domains from modules 1, 3, 4, 6, 7, 10 and 13 
clustered with the 6 erythromycin AT domains and the rapamycins AT domains in modules 2, 5, 
8, 9, 1 1, 12 and 14 formed a separate cluster (Haydock et al. FEBS Letts. 31 '4:246-248 (1995)). 
Examination of the polyketide structure of rapamycins indicated that methyl side chains were at 
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positions on the lactone ring corresponding to condensation steps 1, 3, 4, 6, 7, 10 and 13, which 
suggested that methylmalonate was used as the extender unit during synthesis of these sections of 
the acyl chain; protons at the positions of the lactone ring corresponding to condensation steps 2, 
5, 8, 9, 11, 12 and 14 suggested that malonate was utilized as the extender unit during synthesis of 
these sections. Two additional AT domains described herein, ligAT2 and venAT, were also found 
to cluster with the putative malonate AT domains from the rapamycins PKS (FIG.3). Having 
predicted that AT domains from rap modules 2, 5, 8, 9, 1 1, 12 or 14, as well as ligAT2 and 
venAT, specify malonate as extender units, the DNA encoding such domains could be isolated, 
cloned and used to replace the DNA encoding one or more AT domains in a PKS such as DEBS, 
in order to generate novel polyketides. 

The techniques for determining the amino acid sequence "similarity" are well-known in 
the art. In general, when two or more polypeptides are aligned with one another, their sequence 
similarity refers to the amino acids at corresponding positions within each polypeptide sequence 
that are identical or possess similar chemical and/or physical properties such as charge or 
hydrophobicity. A so-termed "percent similarity" then can be determined between the compared 
polypeptide sequences. In general, the term "identity" refers to an exact nucleotide to nucleotide 
or amino acid to amino acid correspondence at a given position of two polynucleotides or 
polypeptide sequences, respectively. Two amino acid sequences (or for that matter, two or more 
polynucleotide sequences) can be compared by determining their "percent identity." The 
programs available in the Wisconsin Sequence Analysis Package, Version 8 (available for 
Genetics Computer Group (GCG), Madison, WI), for example, the GAP program, are capable of 
calculating both the identity between two polynucleotides and the identity and similarity between 
two polypeptide sequences, respectively. Other programs for calculating and displaying 
similarity between sequences are known in the art. For example, the Growtree program (GCG, 
Madison, WI) creates a phylogenetic tree wherein the most closely related sequences are clustered 
and joined by the shortest lines. This tree is derived from a matrix created by the program 
Distances (GCG, Madison, WI) which calculates pairwise relationships within a group of aligned 
sequences. 

In a preferred embodiment, novel polyketide molecules of desired structure are produced 
by the replacement of at least one AT domain-encoding fragment of DNA of the Sac. erythraea 
chromosome with at least one heterologous AT domain-encoding fragment of DNA from another 
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PKS cluster to yield novel polyketide compounds which are derivatives of 6-deoxyerythronolide 
B, erythronolide B, 3-a-L-mycarosylerythronolide B, or erythromycins A, B, C and D. Such 
derivatives are compounds wherein methyl (-Me) side chains at one or more positions of the 
macrocylic lactone ring are replaced by substituents independently selected from the group 
consisting of (a) -H; (b) ethyl group (-Et); (c) hydroxyl group (-OH) and (d) allyl group (-A1). In 
a particularly preferred embodiment, a method is provided for the genetic modification of 
erythromycin-producing microorganism which enables them to produce the novel compounds 12- 
desmethyl-12-deoxyerythromycin A, 10-desmethylerythromycin A ? 10-desmethyl-12- 
deoxyerythromycin A, or 6-desmethyl-6-ethylerythromycin A. The compounds 12-desmethyl-12- 
deoxyerythromycin A, 10-desmethylerythromycin A, 10-desmethyl-12-deoxyerythromycin A, and 
6-desmethyl-6-ethylerythromycin A are represented by the structural forumulae: 




10-desmethyl-12-deoxyerythromycin A (III) 6-desme thy I -6-e thy 1 erythromycin A (IV) 
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The general scheme for producing such polyketides is outlined in FIG. 4a and FIG. 4b. In the 
preferred embodiment, heterologous DNA fragments encoding related AT domains are introduced 
into the Sac. erythraea chromosome by a two-step method termed gene replacement. 

In the first step of gene replacement, an integration vector is constructed through a multi- 
step cloning approach that places a heterologous gene or fragment thereof between two segments 
of DNA having sequences which are identical to those that immediately border (on each side) the 
resident polynucleotide sequence to be replaced. Construction of such a vector may be achieved 
by any means known to those of ordinary skill in art. For example, nucleotide sequences which 
flank the gene to be replaced can be generated by PCR amplification using chromosomal DNA as 
template and primers which hybridize to the chromosomal sequences immediately upstream and 
downstream of the flanking sequences of interest. The length of the flanking sequences is not 
critical to the practice of the invention but preferably is about 20-5000 base pairs (bp), more 
preferably about 100-5000 bp, and even more preferably about 500-5000 bp. A most preferred 
length of flanking sequence is about 750-1500 bp. Primers used for such amplifications may also 
comprise convenient restriction sites to facilitate cloning of the amplified sequences into suitable 
preparative vectors, to facilitate insertion of the heterologous sequence of interest between the 
flanking sequences and/or to facilitate subcloning of the entire group of sequences (5 '-flanking 
region/heterologous polynucleotide sequence of interest/flanking region-3') into suitable vectors 
for integration. The desired heterologous polynucleotide sequences may be generated in a like 
manner. 

The integration vectors are constructed to also comprise a fragment of DNA containing at 
least one origin of replication that is functional in an intermediate host but is non- functional or 
poorly functional in the production host. The vectors further comprise one or more fragments of 
DNA conferring resistance to an antibiotic, of which at least one functions in the intermediate host 
and at least one functions in the production host. Preferred integration vectors comprise the ColEl 
and pi J 101 origins of replication, as found in plasmid pCS5 (J. Vara et al., J. BacterioL 171:5872- 
5881 (1989)). A particularly preferred vector carries a DNA fragment conferring resistance to 
thiostrepton and ampicillin. However, those skilled in the art understand that the particular 
antibiotic resistance genes and origins of replication identified above are necessary only inasmuch 
as they allow for the generation and selection of the desired recombinant plasmids and host cells. 
Other markers and origins of replication may also be used in the practice of the invention. 
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When the resident domains of a PKS are functional components of large multifunctional 
polypeptides, care must be taken in the construction of the integration plasmid so that the 
heterologous DNA fragment encoding the heterologous AT domain is positioned in the correct 
orientation and reading frame to its flanking DNA segments so that upon translation from the 
beginning of the coding sequence, an enzymatically functional protein is produced. The correct 
positioning becomes immediately apparent from knowledge of the nucleotide sequences of the 
host PKS genes and the heterologous genes used for gene replacement. 

In the second step, each of the integration vectors carrying a related gene or fragment 
thereof is independently introduced into a host strain and recombination between each of the 
genomic fragments in the integration plasmid and its corresponding homologous fragment in the 
host strain chromosome is allowed to occur. This procedure results in the exchange of the resident 
AT-encoding DNA in the chromosome for its heterologous counterpart. The general scheme for 
gene replacement by homologous recombination is outlined in FIG. 5. Procedures to introduce 
DNA into polyketide-producing microorganisms and to facilitate homologous recombination are 
described herein. However, those skilled in the art understand that alternative procedures for 
introducing DNA into a polyketide-producing microorganism, such as electroporation, 
transduction, or conjugation, are well known and may also be used in the practice of the invention. 
Procedures for cultivating polyketide-producing microorganisms, as well as methods to recover 
novel polyketides produced from modified strains, to purify such compounds and to confirm the 
identity of those compounds (such as by mass spectrometry or NMR) are well-known to those of 
ordinary skill in the art. 

Although the present invention is described in the Examples that follow in terms of 
preferred embodiments, they are not to be regarded as limiting the scope of the invention. The 
descriptions that follow serve to illustrate the principles and methodologies involved in creating 
novel derivatives of erythromycin. Whereas the examples below describe the replacement of the 
Sac. erythraea ATI, AT2, and AT4-encoding DNA fragments with a heterologous DNA fragment 
which encodes either an AT domain that specifies incorporation of malonate (malonate-AT) or an 
AT domain that specifies incorporation of ethylmalonate (ethylmalonate-AT), those skilled in the 
art understand that one or more fragments of heterologous DNA encoding malonate, 
ethylmalonate, allylmalonate, and/or hydroxymalonate (tartronate)-AT domains can be used to 
replace the other AT-encoding DNA fragments of the erythromycin PKS in Sac. erythraea to 
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result in the production of other novel erythromycin derivatives. For example, novel 
erythromycins produced when resident AT-encldoign DNA fragments in the erythromycin PKS 
(eryPKS) are independently replaced with heterologous DNA fragments specifying malonate 
and/or ethylmalonate as the extender unit are shown in Table 1 . 

In particular, those skilled in the art understand that following the methods described 
herein for replacement of a single resident AT-encoding DNA fragment in the eryPKS, 
replacements of two resident AT-encoding DNA fragments with heterologous DNA fragments 
(specifying malonate, ethylmalonate, allylmalonate, and/or hydroxymalonate-AT domains) in 
stepwise fashion are also possible and result in the formation of novel disubstituted erythromycins. 
Similarly, trisubstituted erythromycins, tetrasubstituted erythromycins, pentasubstituted 
erythromycins and hexasubstituted erythromycins can also be made by replacement of three, four, 
five and six resident AT-encoding DNA fragments in the eryPKS, respectively, with heterologous 
AT-encoding DNA fragments as described herein. Therefore, all substitutions of AT-encoding 
DNA fragments in the eryPKS with heterologous AT-encoding DNA fragments (yielding all 
varieties of proton, ethyl, allyl, and hydroxyl substituted erythromycin derivatives) are within the 
scope of the present invention. Examples of compounds produced by such replacements include 
but are not limited to those shown in Table 1 below. 



Table 1 



Structures from Changes at Side Chain Positions 
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[ 2-Tetradesmethyl- 1 0, 1 2-diethy lerythromycin A 
[ 2-Tetradesmethyl-2,8, 1 2-tri ethyl erythromycin A 
[ 2-Tetradesmethyl-8, 1 2-diethylerythromycin A 
[ 2-Tetradesmethyl-4,8, 1 2-triethylerythromycin A 
[ 2-Tetradesmethyl-8, 1 2-diethylerythromycin A 
i 2-Tetradesmethyl-6,8, 1 2-triethylerythromycin A 
[ 2-Tetradesmethyl-8, 1 2-diethylerythromycin A 
[ 2-Tetradesmethyl-2,8, 1 0, 1 2-tetraethylerythromycin A 
: 2-Tetradesmethy 1-8, 1 0, 1 2-triethylerythromycin A 
2-Tetradesmethy 1-4,8, 10,1 2-tetraethylerythromycin A 
. 2-Tetradesmethy 1-8 , 10,1 2-triethylerythromycin A 
. 2-Tetradesmethyl-6,8, 1 0, 1 2-tetraethylerythromycin A 
. 2-Tetradesmethyl-8, 1 0, 1 2-triethylerythromycin A 
2-Tetradesmethy 1-2-ethylerythromycin A 
2-Tetradesmethy lerythromycin A 
2-Tetradesmethy 1-4-ethylerythromycin A 
2-Tetradesmethylerythromycin A 
2-Tetradesmethyl-2, 1 0-diethylerythromycin A 
2-Tetradesmethyl- 1 0-ethy lerythromycin A 
2-Tetradesmethyl-4, 1 0-diethylerythromycin A 
2-Tetradesmethyl- 1 0-ethylerythromycin A 
2-Tetradesmethyl-2,6-diethylerythromycin A 
2-Tetradesmethyl-6-ethylerythromycin A 
2-Tetradesmethyl-4,6-di ethyl erythromycin A 
2-Tetradesmethyl-6-ethy lerythromycin A 
2-Tetradesmethyl-2,6, 1 0-triethylerythromycin A 
2-Tetradesmethy 1-6, 1 0-diethylerythromycin A 
2-Tetradesmethyl-4,6, 1 0-triethylerythromycin A 
2-Tetradesmethyl-6, 1 0-diethylerythromycin A 
2-Tetradesmethy 1-2, 1 2-diethylerythromycin A 
2-Tetradesmethyl- 1 2-ethylerythromycin A 
2-Tetradesmethyl-4, 1 2-diethylerythromycin A 
2-Tetradesmethyl- 1 2-ethylerythromycin A 
2-Tetradesmethyl-2, 10,1 2-triethylerythromycin A 
2-Tetradesmethyl- 10,1 2-diethylerythromycin A 
2-Tetradesmethy 1-4, 1 0, 1 2-triethylerythromycin A 
2-Tetradesmethyl- 1 0, 1 2-diethylerythromycin A 
2-Tetradesmethyl-2,6, 1 2-triethylerythromycin A 
2-Tetradesmethyl-6, 1 2-diethylerythromycin A 
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2-Tetradesmethyl-6, 1 2-diethylerythromycin A 
2-Tetradesmethy 1-2,6, 1 0, 1 2-tetraethylerythromycin A 
2-Tetradesmethyl-6, 1 0, 1 2-triethylerythromycin A 
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2,4, 1 0, 1 2-Tetradesmethylerythromycin A 
2,4, 1 0, 1 2-Tetradesmethyl-2, 1 0-diethylerythromycin A 
2,4, 1 0, 1 2-Tetradesmethy 1- 1 0-ethy lerythromycin A 
2,4, 1 0, 1 2-Tetradesmethyl-2,4-diethylerythromycin A 
2,4, 10,1 2-Tetradesmethyl-4-ethylerythromycin A 
2,4, 1 0, 1 2-Tetradesmethyl-2,4, 1 0-triethylerythromycin A 
2,4, 1 0, 1 2-Tetradesmethyl-4, 1 0-diethylerythromycin A 
2,4, 1 0, 1 2-Tetradesmethyl-2, 1 2-diethylerythromycin A 
2,4, 10,1 2-Tetradesmethyl- 1 2-ethyl erythromycin A 
2,4, 1 0, 1 2-Tetradesmethyl-2, 1 0, 1 2-triethylerythromycin A 
2,4, 1 0, 1 2-Tetradesmethyl- 1 0, 1 2-diethylerythromycin A 
2,4, 1 0, 1 2-Tetradesmethyl-2,4, 1 2-triethylerythromycin A 
2,4, 1 0, 1 2-Tetradesmethy 1-4, 1 2-diethylerythromycin A 
2,4, 1 0, 1 2-Tetradesmethyl-2,4, 1 0, 1 2-tetraethylerythromycin A 
2,4,10,1 2-Tetradesmethyl-4, 1 0, 1 2-triethylerythromycin A 
2,6,8, 1 2-Tetradesmethyl-2-ethylerythromycin A 
2,6,8,1 2-Tetradesmethylerythromycin A 
4,6,8, 1 2-Tetradesmethyl-4-ethylerythromycin A 
4,6,8, 1 2-Tetradesmethylerythromycin A 
2,6,8,12-Tetradesmethyl-2,8-diethylerythromycin A 
2,6,8, 1 2-Tetradesmethyl-8-ethylerythromycin A 
4,6,8, 1 2-Tetradesmethyl-4„8-diethylerythromycin A 
4,6,8, 1 2-Tetradesmethyl-8-ethylerythromycin A 
2,6,8, 1 2-Tetradesmethyl-2,6-diethylerythromycin A 
2,6,8, 1 2-Tetradesmethyl-6-ethylerythromycin A 
4,6,8, 1 2-Tetradesmethyl-4,6-diethylerythromycin A 
4,6,8, 1 2-Tetradesmethyl-6-ethylerythromycin A 
2,6,8, 1 2-Tetradesmethyl-2,6,8-triethylerythromycin A 
2,6,8, 1 2-Tetradesmethyl-6,8-diethylerythromycin A 
4,6,8, 1 2-Tetradesmethyl-4,6,8-triethylerythromycin A 
4,6,8, 1 2-Tetradesmethyl-6,8-diethylerythromycin A 
2,6,8, 1 2-Tetradesmethyl-2, 1 2-diethylerythromycin A 
2,6,8, 1 2-Tetradesmethyl- 1 2-ethylerythromycin A 
4,6,8, 1 2-Tetradesmethy 1-4, 1 2-diethylerythromycin A 
4,6,8, 1 2-Tetradesmethyl- 1 2-ethylerythromycin A 
2,6,8, 1 2-Tetradesmethyl-2, 8, 1 2-triethylerythromycin A 
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. 2-Pentadesmethyl-2,6-diethylerythromycin A 
. 2-Pentadesmethyl-2,4-diethylerythromycin A 
. 2-Pentadesmethyl-6,8, 1 2-triethylerythromycin A 
. 2-Pentadesmethyl-4,8, 1 2-triethylerythromycin A 
. 2-Pentadesmethyl-2,8, 1 2-triethylerythromycin A 
2-Pentadesmethyl-4,6, 1 2-triethylerythromycin A 
2-Pentadesmethyl-2,6, 1 2-triethylerythromycin A 
2-Pentadesmethyl-4,6 5 8-triethylerythromycin A 
2-Pentadesmethyl-2,6,8-triethylerythromycin A 
2-Pentadesmethyl-2 ? 4 J 8-triethylerythromycin A 
2-Pentadesmethyl-2 ? 4,6-triethylerythromycin A 
2-Pentadesmethyl-4 J 6 5 8-triethylerythromycin A 
2-Pentadesmethyl-2,6,8, 1 2-tetraethylerythromycin A 
2-Pentadesmethyl-2,4,8, 1 2-tetraethylerythromycin A 
2-Pentadesmethyl-2,4,6, 1 2-tetraethylerythromycin A 
2-Pentadesmethyl-2 ? 4 ? 6,8-tetraethylerythromycin A 
2-Pentadesmethyl-2 ,4,6,8 , 1 2-pentaethylerythromycin 

O-Pentadesmethylerythromycin A 
O-Pentadesmethyl- 1 0-ethylerythromycin A 
O-Pentadesmethyl-8-ethylerythromycin A 
O-Pentadesmethyl-6-ethylerythromycin A 
O-Pentadesmethyl-4-ethylerythromycin A 
O-Pentadesmethyl-2-ethylerythromycin A 
O-Pentadesmethyl-8, 1 0-diethylerythromycin A 
O-Pentadesmethyl-6, 1 0-diethylerythromycin A 
O-Pentadesmethy 1-4, 1 0-diethylerythromycin A 
O-Pentadesmethyl-2, 1 0-diethylerythromycin A 
0-Pentadesmethyl-6,8-diethylerythromycin A 
0-Pentadesmethyl-4,8-diethylerythromycin A 
0-Pentadesmethyl-2,8-diethylerythromycin A 
0-Pentadesmethyl-4,6-diethylerythromycin A 
0-Pentadesmethyl-2,6-diethylerythromycin A 
0-Pentadesmethyl-2,4-diethylerythromycin A 
O-Pentadesmethyl-6,8, 1 0-triethylerythromycin A 
O-Pentadesmethyl-4,8, 1 0-triethylerythromycin A 
O-Pentadesmethyl-2,8, 1 0-triethylerythromycin A 
O-Pentadesmethyl-4,6, 1 0-triethylerythromycin A 
O-Pentadesmethyl-2,6, 1 0-triethylerythromycin A 
O-Pentadesmethyl-2,4, 1 0-triethylerythromycin A 
0-Pentadesmethyl-4,6,8-triethylerythromycin A 
0-Pentadesmethyl-2,6,8-triethylerythromycin A 
0-Pentadesmethyl-2,4,8-triethylerythromycin A 
0-Pentadesmethyl-2,4,6-triethylerythromycin A 
O-Pentadesmethyl-4,6,8, 1 0-tetraethylerythromycin A 
O-Pentadesmethyl-2,6,8,1 0-tetraethylerythromycin A 
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1 O-Pentadesmethyl-2,4,8 , 1 0-tetraethy lerythromycin A 
1 0-Pentadesmethyl-2,4,6, 1 O-tetraethylerythromycin A 
1 0-Pentadesmethyl-2,4,6,8-tetraethylejythromycm A 
1 0-Pentadesmethyl-2,4,6,8, 1 0-pentaethyleiythromycin 



10,12-Hexadesmethylerythromycin A 
10,1 2-Hexadesmethyl- 1 2-ethylerythromycin A 
1 0, 1 2-Hexadesmethyl- 1 0-ethylerythromycin A 
1 0, 1 2-Hexadesmethyl-8-ethylerythromycin A 
1 0, 1 2-Hexadesmethyl-6-ethylerythromycin A 
1 0, 1 2-Hexadesmethyl-4-ethylerythromycin A 
10,1 2-Hexadesmethyl-2-ethylerytliromycin A 
1 0, 1 2-Hexadesmethyl- 1 0, 1 2-diethylerythromycin A 
1 0, 1 2-Hexadesmethyl-8, 1 2-diethylerythromycin A 
10,1 2-Hexadesmethyl-6, 1 2-diethylerythromycin A 
10,1 2-Hexadesmethyl-4, 1 2-diethylerythromycin A 
1 0, 1 2-Hexadesmethyl-2, 1 2-diethylerythromycin A 
1 0, 1 2-Hexadesmethyl-8, 1 O-diethylerythromycin A 
1 0, 1 2-Hexadesmethyl-6, 1 0-diethylerythromycin A 
1 0, 1 2-Hexadesmethyl-4, 1 0-diethylerythromycin A 
1 0, 1 2-Hexadesmethyl-2, 1 0-diethylerythromycin A 
10,12-Hexadesmethyl-6,8-diethylerythromycin A 
1 0, 1 2-Hexadesmethyl-4,8-diethylerythromycin A 
1 0, 1 2-Hexadesmethyl-2,8-diethylerythromycin A 
1 0, 1 2-Hexadesmethyl-4,6-diethylerythromycin A 
1 0, 1 2-Hexadesmethyl-2,6-diethylerythromycin A 
1 0, 1 2-Hexadesmethyl-2,4-diethylerythromycin A 
10,1 2-Hexadesmethy 1-8, 1 0, 1 2-triethylerythromycin A 
1 0, 1 2-Hexadesmethy 1-6, 1 0, 1 2-triethylerythromycin A 
1 0, 1 2-Hexadesmethyl-4, 10,1 2-triethylerythromycin A 
1 0, 1 2-Hexadesmethyl-2, 1 0, 1 2-triethylerythromycin A 
1 0, 1 2-Hexadesmethyl-6,8, 1 2-triethylerythromycin A 
1 0, 1 2-Hexadesmethyl-4,8, 1 2-triethylerythromycin A 
1 0, 1 2-Hexadesmethyl-2,8, 1 2-triethylerythromycin A 
1 0, 1 2-Hexadesmethyl-4,6, 1 2-triethylerythromycin A 
1 0, 1 2-Hexadesmethyl-2,6, 1 2-triethylerythromycin A 
1 0, 1 2-Hexadesmethyl-2,4, 1 2-triethylerythromycin A 
1 0, 1 2-Hexadesmethyl-6,8, 1 0-triethylerythromycin A 
1 0, 1 2-Hexadesmethyl-4,8, 1 0-triethylerythromycin A 
10,12 -Hexadesmethy 1 -2 ,8,1 0-triethylerythromycin A 
1 0, 1 2-Hexadesmethyl-4,6, 1 0-triethylerythromycin A 
1 0, 1 2-Hexadesmethyl-2,6, 1 0-triethylerythromycin A 
1 0, 1 2-Hexadesmethyl-2,4, 1 0-triethylerythromycin A 
1 0, 1 2-Hexadesmethyl-4,6,8-triethylerythromycin A 
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2,4,6,8, 1 0, 1 2-Hexadesmethyl-2,6,8-triethylerythromycin A 
2,4,6,8, 1 0, 1 2-Hexadesmethyl-2,4,8-triethylerythromycin A 
2,4,6,8, 1 0, 1 2-Hexadesmethyl-2,4,6-triethylerythromycin A 
2,4,6,8, 1 0, 1 2-Hexadesmethyl-6,8, 1 0, 1 2-tetraethylerythromycin 
A 

2,4,6,8, 1 0, 1 2-Hexadesmethyl-4,8, 1 0, 1 2-tetraethylerythromycin 
A 

2,4,6,8, 1 0, 1 2-Hexadesmethyl-2,8, 1 0, 1 2-tetraethylerythromycin 
A 

2,4,6,8, 1 0, 1 2-Hexadesmethyl-4,6, 1 0, 1 2-tetraethylerythromycin 
A 

2,4,6,8, 1 0, 1 2-Hexadesmethyl-2,6, 1 0, 1 2-tetraethylerythromycin 
A 

2,4,6,8, 1 0, 1 2-Hexadesmethyl-2,4, 1 0, 1 2-tetraethylerythromycin 
A 

2,4,6,8, 1 0, 1 2-Hexadesmethyl-4,6,8, 1 2-tetraethylerythromycin 
A 

2,4,6,8,10,12-Hexadesmethyl-2,6,8,12-tetraethylerythromycin 
A 

2,4,6,8, 1 0, 1 2-Hexadesmethyl-2,4,8, 1 2-tetraethylerythromycin 
A 

2,4,6,8, 1 0, 1 2-Hexadesmethyl-2,4,6, 1 2-tetraethylerythromycin 
A 

2,4,6,8, 1 0, 1 2-Hexadesmethyl-4,6,8, 1 0-tetraethylerythromycin 
A 

2,4,6,8, 1 0, 1 2-Hexadesmethyl-2,6,8, 1 0-tetraethylerythromycin 
A 

2,4,6,8, 1 0, 1 2-Hexadesmethyl-2,4,8, 1 0-tetraethylerythromycin 
A 

2,4,6,8, 1 0, 1 2-Hexadesmethyl-2,4,6, 1 0-tetraethylerythromycin 
A 

2,4,6,8 , 1 0, 1 2-Hexadesmethyl-2,4,6,8-tetraethylerythromycin A 

2,4,6,8, 1 0, 1 2-Hexadesmethyl-4,6,8, 10,12- 

pentaethylerythromycin A 

2,4,6,8, 1 0, 1 2-Hexadesmethyl-2,6,8, 1 0, 1 2- 

pentaethylerythromycin A 

2,4,6,8, 1 0, 1 2-Hexadesmethyl-2,4,8, 1 0, 1 2- 

pentaethylerythromycin A 

2,4,6,8, 1 0, 1 2-Hexadesmethyl-2,4,6, 10,12- 

pentaethylerythromycin A 

2,4,6,8, 1 0, 1 2-Hexadesmethyl-2,4,6„8, 1 2- 

pentaethylerythromycin A 

2,4,6,8, 1 0, 1 2-Hexadesmethyl-2,4,6,8, 1 0- 

pentaethylerythromycin A 

2,4,6,8, 1 0, 1 2-Hexadesmethyl-2,4,6,8, 10,12- 

hexaethylerythromycin A 
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Although in the Examples that follow the AT-encoding DNA fragments from S. 
hygroscopicus ATCC 29253, S. venezuelae ATCC 15439, and S. caelestis NRRL-2821 were used 
to replace resident AT-encoding DNA fragments in the eryPKS to yield desmethyl, 
desmethylethyl, and desmethylhydroxerythromycins, it is understood that many malonate, 
ethylmalonate, and hydroxymalonate AT-encoding DNA fragments can be used in place of or in 
addition to the heterologous malonate, ethylmalonate, and hydroxymalonate-AT DNA fragments 
described herein to produce the same desmethyl, desmethylethyl, and 

desmethylhydroxyerythromycin compounds. Examples of DNA fragments encoding malonate- 
AT domains that can be used in place of or in addition to those specifically described in the 
Examples below include but are not limited to the DNA fragments encoding AT domains from 
modules 2,5,8,9,1 1, or 12 of the rapamycin PKS genes from S. hygroscopicus, the AT domain 
from module 2 of the PKS responsible the synthesis of methymycin or pikromycin by S. 
venezuelae, the AT domains from modules 3 and 7 of the PKS responsible for the synthesis of 
tylosin by S.fradiae, or the AT domains from modules 1,2,3 and 7 of the PKS responsible for the 
synthesis of spiramycin by S. ambofaciens. Examples of DNA fragments encoding 
ethylmalonate-AT domains that can be used in place of or in addition to those specifically 
described in the Examples below include but are not limited to the DNA fragments encoding the 
AT domain from module 5 of the spiramycin PKS genes from S. ambofaciens, the AT domain 
from module 5 of the tylosin PKS genes from S.fradiae, and the AT domain from module 5 of the 
maridomycin PKS genes of S. hygroscopicus. Examples of DNA fragments encoding 
hydroxymalonate-AT domains that can be used in place of or in addition to those specifically 
described in the Examples below include but are not limited to the DNA fragments encoding the 
AT domain from module 6 of the spiramycin PKS genes from S. ambofaciens, the AT domain 
from module 6 of the leucomycin PKS genes from Stretoverticillium kitsastoensis. Thus the use 
of any and all DNA fragments encoding malonate, ethylmalonate, and hydroxymalonate-ATs to 
replace any of the resident DNA fragments encoding methylmalonate-ATs in the eryPKS genes to 
result in the production of novel derivatives of erythromycin are considered within the scope of 
the present invention. 

Furthermore, those of ordinary skill understand that following the methods described 
herein for replacement of resident AT-encoding DNA fragments in the eryPKS, the DNA 
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fragments encoding malonate-ATs in S. hygroscopicus, S. venezuelae, or S. caelestis, and 
ethylmalonate or hydroxymalonate-ATs in S. caelestis may be replaced with those AT-encoding 
DNA fragments from the eryPKS which utilize methylmalonyl CoA as a substrate. As with the 
eryPKS, all combination are contemplated, leading to the production of, for example, 13- 
methylrapamycin, 15-methylrapamycin, 33-methylrapamycin, 13,15-dimethylrapamycin, 
13,15,33-trimethylrapamycin, and 10-methylpikromycin. 

The methods of the present invention are widely applicable to all erythromycin-producing 
microorganisms, of which a non-exhaustive list includes Saccharopolyspora species, 
Streptomyces griseoplanus, Nocardia sp., Micromonospora sp., Arthrobacter sp. and 
Streptomyces antibioticus . Of these, Sac, erythraea is the most preferred. Other hosts, which 
normally do not produce erythromycin but into which the erythromycin biosynthesis genes can be 
introduced by cloning, can also be employed. Such strains include but are not limited to 
Streptomyces coelicolor and Streptomyces lividans or Bacillus subtilis, as examples. In each of 
the other erythromycin-producing strains, replacement of the resident AT domains in the 
erythromycin PKS is conducted by double homologous recombination using cloned eryPKS 
sequences on both sides of the AT domain to be replaced to effect the switching of the resident AT 
with a heterologous AT as illustrated in the Examples that follow. 

Many other variations of the methods that are illustrated in the Examples that follow will 
occur to those skilled in the art. For example, whereas the plasmids pUC18, pUC19, pGEM3Zf, 
and pCS5 were employed in the present invention for the cloning of the LigAT2, venAT, 
rapAT14, NidATS, or NidAT6-encoding DNA fragments and construction of the integration 
vectors, other plasmids, phage, or phagemids including but not limited to pBR322, pACYC184, 
M13mpl8, M13mpl9, pGEM7Zf and the like can be used in their place to allow the same 
constructions to be made. Furthermore, many alternative strategies can be followed for the 
cloning of the heterologous AT-encoding DNA fragments into integration vectors that enable 
homologous recombination to occur in corresponding regions of the eryPKS. Examples of 
alternative strategies include the use of longer or shorter fragments of DNA corresponding to 
either the AT domains or the flanking sequences, using different restriction sites for the cloning of 
the AT domains or the adjacent flanking sequences, or changing the sequence of a resident AT- 
encoding DNA fragment so that it expresses a domain which recognizes malonyl CoA as a 
substrate rather than methylmalonyl CoA. All such variations are within the scope of the present 
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invention. Similarly, employing alternative strategies to introduce DNA into Sac. erythraea or 
other eiythromycin-producing hosts for the purpose of effecting gene exchange to result in the 
production of novel erythromycins, such as conjugation, transduction or electroporation are also 
included within the scope of the present invention. 

Those skilled in the art also understand that erythromycins B, C and D are naturally 
occurring forms of erythromycin and therefore would be produced as novel derivatives in Sac. 
erythraea by the modifications disclosed herein. Production of these forms may be further 
enhanced by inactivation of eryK (Staddi, D. et al. J. Bacteriology, 175:182-189, (1993)) to yield 
erythromycin B derivatives, eryG (S.F. Haydock et al. Mol Gen. Genet 230:120-128 (1991)) to 
yield erythromycin C derivatives and eryK and eryG to yield erythromycin D derivatives. 
Furthermore, in Sac. erythraea, 6-deoxy forms of the novel erythromycins A, B, C and D can be 
generated by inactivation of eryF (J.M. Weber et al. Science 252:1 14-1 17 (1991)) (in addition to 
those specified above), which encodes the hydroxylase responsible for hydroxylating the C-6 
position. In addition, conversion of 6-deoxy forms of the novel erythromycins A, B, C and D to 
their corresponding erythromycin A,B,C, and D derivatives may be accomplished by cloning 
additional copies or by employing other means of overexpression of the eryF gene in the 
production host. Similarly, conversion of novel forms of erythromycins B, C and D to novel 
forms of erythromycin A may be achieved by expressing or overexpressing eryK and/or eryG in 
the production host. The methodologies for generating erythromycins B, C and D and 6- 
deoxyerythromycins A, B, C and D are well known to those of ordinary skill in the art. 

Those skilled in the art also understand that erythronolide B and 3-a- 
mycarosylerythronolide B are naturally occurring intermediates in the biosynthesis of 
erythromycin and therefore would be produced as novel intermediates in Sac. erythraea by the 
modifications disclosed herein. Production of these forms may be further enhanced by 
inactivation of any of the eryB genes to yield erythronolide B or eryC genes to yield 3-a-L- 
mycarosylerythronolide B (Weber et al. J. Bacteriol 172:2372-2383 (1990)) and Haydocket et al. 
Mol. Gen. Genet. 230:120-128 (1991)). Furthermore, 6-deoxy forms of these novelintermediates 
can be generated by inactivation of eryF as described above. The methodologies for generating 
erythronolide B and 3-a-mycarosylerythronolide B, as well as their 6-deoxy derivatives, are well 
known to those of ordinary skill in the art. 



45 



Bacterial Strains, Plasmid Vectors, and Growth Media 



The erythromycin-producing microorganism used to practice the following examples of 
the invention was Sac. erythraea ER720 (LP. DeWitt, Bacteriol 164: 969 (1985)). The host 
strain for the growth of E. coli derived plasmids was DH5ct from GIBCO BRL, Gaithersburg, 
MD). The 5. hygroscopicus strain that carries the Lig-PKS cluster is available from the American 
Type Culture Collection, Bethesda, MD under the accession number ATCC 29253. The S. 
venezuelae strain that carries the venAT domain described herein is available from the American 
Type Culture Collection , Bethesda, MD under the accession number ATCC 15439. 

E. coli bacteria carrying pUC18/venAT has been deposited at the Agricultural Research 
Culture Collection (NRRL), 1815 N. University Street, Peoria, Illinois 61604 U.S.A., as of 
December 23, 1996, under the terms of the Budapest Treaty and will be maintained for a period of 
thirty (30) years from the date of deposit, or for five (5) years after the last request for the deposit, 
or for the enforceable period of the U.S. patent, whichever is longer. The deposit and any other 
deposited material described herein are provided for convenience only, and are not required to 
practice the present invention in view of the teachings provided herein. The DNA sequence in all 
of the deposited material is incorporated herein by reference. E. coli bacteria carrying 
pUC18/venAT was accorded NRRL Deposit No B-21652. 

Plasmid pUC18 and pUC19 can be obtained from GIBCO BRL. Plasmid pCS5, a 
multifunctional vector for integrative transformation of Sac. erythraea is described in Vara, et al, 
J. Bacteriology , 171 :5872-5891 (1989) and is referred to therein as pWHM3. Cosmid pNJl is 
described in Tuan, et al., Gene, 90: 21-29 (1990). 

Sac. erythraea was grown for protoplast formation and routine liquid culture in 50 mL of 
SGGP medium (Yamamoto, et al., J. Antibiotic. 39:1304 (1986)), supplemented with 10 |ug of 
thiostrepton/mL for plasmid selection where appropriate. 

Reagents and General Methods 



46 



Commercially available reagents were used to make compounds, plasmids and genetic 
variants of the present invention, including butyric acid, ampicillin, thiostrepton, restriction 
endonucleases, T4-DNA ligase, and calf intestine alkaline phosphatase. The nucleotide sequence 
of the eryA genes from Sac. erythraea has been deposited in the GenBank database under the 
accession numbers M63676 and M63677 and are publicly available. 

Standard molecular biology procedures (Maniatis et al supra) were used for the 
construction and characterization of replacement plasmids. Plasmid DNA was routinely isolated 
by the alkaline lysis method (H. C. Birnboim and J. Doly, 1979 Nucleic Acids Res. 7-1513) or 
with QIAprep Spin Plasmid kit (Qiagen, Inc., Chatsworth, CA) according to the, manufacturers 
instructions. Restriction fragments were recovered from 0.8- 1% agarose gels with Prep-A-Gene 
(BioRad). The products of ligation for each step of the plasmid constructions were used to 
transform the intermediate host, E. coli DH5oc (GIBCO BRL), which was cultured in the 
presence of ampicillin to select for host cells carrying recombinant plasmids. Selection for insert 
DNA with X-gal was used where appropriate. Typically, LB plates contain 30 mL of LB agar 
(Maniatis et al supra). Plasmid DNAs were isolated from individual transformants that had 
been grown in liquid culture and characterized with respect to known restriction sites. DNA 
sequence determination was by cycle sequencing (fmol DNA Sequencing System, Promega 
Corp. Madison, WI) according to the manufacturer's instructions. 

SCM medium consist of 20 g Soytone, 15 g Soluble Starch, 10.5 g MOPS, 1.5 g Yeast 
Extract and 0.1 g CaCl2 per liter of distilled H2O. SGGP medium is described in Yamamoto, et 
al, 1986, J. Antibiotic. 39:1304. P M buffer (per liter) is 200 g sucrose, 0.25 g K 2 S0 4 in 890 mL 
H20, with the addition after sterilization of 100 mL 0.25 M TES, pH 7.2, 2 mL trace elements 
solution (Hopwood, et al., 1985, Genetic Manipulation of Streptomyces A Laboratory Manual, 
The John Innes Foundation), 0.08mL 2.5 M CaCl 2 , 10 mL 0.5% KH 2 P0 4 , 2 mL 2.5 M MgCl 2 . 

Integrative transformation of Sac. erythraea protoplasts, and routine growth and 
sporulation were carried out according to procedures described in Donadio, et al, 1991, Science 
lj_5:97; Weber and Losick, 1988, Gene 68^173; and Yamamoto, et al., 1986, J. Antibiotic. 
39:1304. 
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Oligo primers used in the PCR amplifications and described in the Examples below are 
as follows: 



5'-ATCTACACSTCSGGCACSACSGGCAAGCCSAAGGG-3' 

5'-CTSAAGGCSGGCGGCGCSTACGTSCCSATCGACCC)-3' 

5 ' -CGCG AATTCCT AGGCTGGCGGTG ATGTTC A-3 ' 

5 '-GCCGGATCCATGC ATACGTCGGCAGGGAGGTAC-3 ' 

5 '-GCTCGAATTCGCTGGTCGCGGTGC ACCT-3 ' 

5 '-GACGGATCCGGCCCTAGGCTGCGCCCGGCTCG-3 ' 

5 ' -TTGGGATCCT ATGC ATTCC AGCGCGAGCGC-3 ' 

5'-GAGAAGCTTGGCGCGACTTGCCCGCT-3' 

5 ' -TTTTTTAAGCTTGGTACCTGCTC ACCGGCAAC ACCG-3 ' 

5 ' -TTTTTTGGATCCCTGC AGCCT AGGGTCGGAGGCACTGCCGGT-3 ' 

5 ' -TTTTTTCTGCAGT ATGC ATTCC AGGGCAAGCGGTTCT-3 ' 

5 '-TTTTTTGAATTCACGCGTTGCCCGCGGCGTAGGCGC-3 ' 

5 '-GATCGAATTCCCTAGGACGGC AGTCCTGCTCACC-3 ' 

5 '-GATCGGATCC ATGCATACGTCGGAAGGTCGACCCG-3 ' 

5 ' -TTCGAAG AATTCCCT AGGGTTGCCTTCCTGTTCGAC-3 ' 

5 ' -TTCGAAAAGCTTATGC AT AGACCGGCAGATCC ACCG-3 ' 

5'-CGGTSAAGTCSAACATCGG-3' 

5 ' -GCRATCTCRCCCTGCGARTG-3 ' 

5 '-GAGAGAGGAACCAACGCGCACGTGATCGTCGAAGAGGCACCAGC-3 ' 
5'-GAGAGAGGATCCGACCTAGGCGCGGAGGTCACCGGCGCGACGGCG- 
3' 

5 '-GAGAGACCTAGGAAGCCGGTGTTCGTGTTCCCCGGCC AGGGCT-3 ' 
5'- 

GAGAGAGGATCCGAGGCCGGCCGTGCGCCCGGACCGAAGACCGCCTC- 
3' 

5 ' -GAGAGAATTCCCT AGGGTCGCCTTCGTCTTTCCCGGGC AGG-3 ' 
5'-TTGAGATCTTATGCATACGAGGGAAGCGGCACCCTGC-3' 
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SEQ ID NO: 23 
SEQ ID NO: 24 



SEQ ID NO: 25 
SEQ ID NO: 26 



Mass spectrometry was routinely performed with a Finnigan-MAT 7000 mass 
spectrometer equipped with an atmospheric pressure chemical ionization source (APCI). 
Electrospray mass spectrometry (ESI-MS) was performed with a Finnigan-MAT 752-7000 mass 
spectrometer equipped with a Finnigan atmospheric pressure ionization (API) source. HPLC 
separation was carried out on a Hewlett-Packard 1050 liquid chromatograph using a Prodigy ODS 
(2) column (5um, 50x2mm) and a gradient elution of 5mM ammonium acetate and methanol. The 
flow rate was 0.3 mL/min. 
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For large scale preparation of erythromycin derivatives, fermentation beers are typically 
adjusted to pH 9 with NH4OH and then extracted two times with an equal volume of CH2CI2. The 
pooled extract is then concentrated to a wet oil (approx. 1 g per liter of fermentation beer). 
Concentrated extracts are digested in methanol and chromatographed over a column of 
Sephadex® LH-20 (Pharmacia Biotech, Uppsala, Sweden) in the same solvent. Fractions are 
tested for bioactivity against Staphylococcus aureus, and active fractions are combined and 
concentrated. When additional column chromatography is desired to reduce sample weight, the 
concentrated sample is digested in a solvent system consisting of n-heptane, chloroform, ethanol 
(10:10:1, v/v/v) and chromatographed over a column of Sephadex® LH-20 in the same system. 
Fractions are then analyzed by ! H NMR, focusing on the characteristic erythromycin resonances 
around 8 = 5.0 (H- 13), 8 = 4.9 (H-l"), and 8 = 4.4 (H-l') (Everett and Tyler, J. Chem. Soc. Perkin 
Trans. 1, pg. 2599 (1985)) and pooled according to purity. Alternatively, column chromatography 
is replaced with all extraction sequence. In this case, the initial pooled CH2CI2 extract is 
concentrated to approximately 400 mL. This is extracted twice with equal volumes of 0.05 
aqueous potassium phosphate with the pH chosen between pH 4.5-6. The aqueous phase is then 
pooled, adjusted to pH 8-9, and extracted twice with equal volumes of ethyl acetate. Finally, the 
ethyl acetate extracts are pooled and concentrated. When additional reduction in sample weight is 
desired, the extraction sequence is repeated oil a 10-50 fold smaller scale, typically yielding about 
500 mgs of partially pure material. 

High resolution separation of erythromycin derivatives is obtained by one or more rounds 
of countercurrent chromatography (Hostettmann and Marston, Anal Chim. Acta, 236:63-76 
(1990)). When the weight of the partially pure sample from column chromatography or the 
extraction sequence is less than 5 g, but greater than 0.5 g, it is digested in 7 mL of the upper 
phase of a solvent system (3:7:5, v/v/v) consisting of n-hexane, ethyl acetate, 0.02 M aqueous 
potassium phosphate, with a pH chosen between 6.5-8.0, and chromatographed on a custom 
droplet countercurrent chromatography (DCCC) instrument [100 vertical columns, 0.4 cm dia. x 
24 cm length; Hostettmann and Marston, Anal Chim. Acta, 236:63-76 (1990)] in the same system 
with the upper phase as the mobile phase. Flow rates of approximately 120-200 mL/hr are 
employed. As before, fractions are analyzed by NMR and bioactivity, and pooled according to 
purity. When the weight of the partially pure sample is approximately 0.5 g or less, countercurrent 
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chromatography is carried out on an Ito multi-layered horizontal Coil Planet Centrifuge (P.C Inc., 
Potomac, MD) using either the system consisting of n-hexane, ethyl acetate, 0.02 M aqueous 
potassium phosphate, with the pH chosen between 6.5-8.0, (3:7:5, v/v/v) employed above, or 
similar systems in which the ratio of hexane to EtOAc and/or the pH are varied. The 
chromatography is developed either isocratically, or with a gradient starting, for example, with the 
upper phase of a solvent system consisting of n-hexane, ethyl acetate, 0.02M aqueous potassium 
phosphate, with the pH chosen between 6.5-8.0, (7:3:5, v/v/v) and finishing with the upper phase 
of a solvent system consisting of n-hexane, ethyl acetate, 0.02 M aqueous potassium phosphate, at 
the same pH, (1:1:1, v/v/v). In all cases, flow rates of approximately 120 mL/hr are employed. 
As before, fractions are analyzed by NMR and bioactivity, and pooled according to purity. Once 
sufficient purity is achieved, ] H and 13 C NMR spectra are measured with a General Electric 
GN500 spectrometer and structural assignments are made with the aid of with the aid of 
correlational spectroscopy (COSY), heteronuclear multiple quantum correlation (HMQC), 
heteronuclear multiple bond correlation (HMBC), and distortionless enhancement by polarization 
transfer (DEPT) experiments. 

The foregoing can be better understood by reference to the following examples, which are 
provided as non-limiting illustrations of the practice of the instant invention. 

EXAMPLE 1 : Cloning of the LigAT2 Domain from 
Streptomyces hygroscopicus ATCC 29253 
A genomic library of Streptomyces hygroscopicus ATCC 29253 DNA was constructed in 
the Afunctional cosmid pNJl (Tuan, et al., Gene 90:21-29 (1990)) using standard methods of 
recombinant DNA technology. Briefly, cosmid vector was prepared by digesting approximately 5 
jag of pNJl with EcoKL, dephophorylating with calf intestinal alkaline phosphatase (CIAP) and 
then digesting the Bglil to generate one arm and also digesting 5 \xg of pNJl with Hindlll, 
dephophorylating with CIAP and then digesting with BgUl to generate the other. Insert DNA was 
prepared by partially digesting approximately 25 jj,g of high molecular weight S. hygroscopicus 
chromosomal DNA with SaulllA according to the procedure outlined in Maniatis, et al. supra, 
&zwIIIA fragments of approximately 35 kb were recovered from a 0.5% low melting point agarose 
gel by melting the appropriate gel slice to 65 °C, adding 3 volumes of TE buffer, gently extracting 
2X with phenol and once with chloroform and ethanol precipitating the aqueous phase. For the 
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ligation, approximately 3 ]ug of this chromosomal DNA was mixed with approximately 0.5 jag of 
each cosmid arm and EtOH precipitated. The precipitate was resuspended in 7 jj,L of water to 
which was added 2 \xL of 5X ligation buffer and l|iL of T4 DNA ligase. The mixture was 
incubated overnight at 16°C. Gigapackll XL (Stratagene®) was used for packaging 2 (iL of the 
ligation mix according to the manufacture's instructions. The host bacterium was E. coli ER1772 
from New England Biolabs (Beverly, MA). Twenty-six colonies were examined by restriction 
analysis and all were found to contain insert DNA. Individual colonies were picked into thirty- 
four 96-well plates to give a 99.99% probability that the library represented all S. hygroscopicus 
sequences. Further restriction analysis demonstrated the average insert size to be about 30 kb. 

The library was screened with a 1 .45 kb Ssl-Mscl DNA fragment encompassing the 
ketosynthase (KS) domain from module 5 of the erythromycin PKS gene eryAIII (Donadio and 
Katz, 1992, Gene, 111 :51-60). The DNA fragment was labeled with 32 P using the Megaprime 
DNA labeling system (Amersham Life Science, Arlington Heights, IL). Colonies (3600) were 
transferred from 96-well plates to Hybond-N nylon membranes (Amerham Life Science, 
Arlington Heights, IL) and probed according to procedures outlined in Maniatis, et al. supra. 
Hybridization was performed at 65°C and a stringency wash carried out with 0.1 x SSC at 65°C. 
About 60 cosmid clones were chosen which gave the strongest signals with this PKS probe. 

We also decided to screen Southern digests of these clones with a second probe in order to 
identify potential genetically linked peptide synthetases in this strain. The probe was designed 
from conserved motifs of nonribosomal peptide synthetases (Borchert et al., 1992, FEMS 
Microbiology Letters, 92: 175-180) and consisted of a mixture of two degenerative 35-mers, SEQ 
ID NO:3 and SEQ ID NO:4. The mixed probe was labeled using DNA 5' End Labeling System 
(Promega Corp., Madison, WI). The 60 cosmid clones were digested with Sma I and run on 0.9% 
agarose gels. Southern analysis was performed according to Maniatis, et al. supra, except that 
hybridization was overnight at 55°C and the stringency wash was with 0.5x SSC at 55°C. Two 
cosmids, 54 and 58, were identified using this second probe. Thirteen additional cosmids were 
subsequently isolated by re-probing tile cosmid library with it 1 kb fragment from the left of the 
insert of cosmid 58. Two of these thirteen cosmids, designated A15 and A16, were then further 
analyzed by restriction analysis and DNA sequencing. Restriction and sequence analysis of it 32.8 
kb continuous segment of DNA from A16 revealed a type I PKS cluster with four PKS modules. 
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A genetic map of the cluster is shown in FIG. 6. Since an unusual CoA ligase-like domain was 
found in ORF1 (PKS1), the cluster was named "Lig-PKS". 

The nucleotide sequence of the LigAT2 domain from Lig-PKS (top strand) and its 
corresponding amino acid sequence (bottom strand) are shown in FIG. 7 (SEQ ID NO: 1 and 
SEQ ID NO:31 respectively). When SEQ ID NO:31 was compared with the 14 AT domains in 
the rapamycin PKS (Growtree Program, GCG, Madison WI), it was found to cluster with 
malonate-specifying rapamycin domains (see Growtree analysis of FIG. 3). Therefore, it was 
predicted that the LigAT2 specifies malonate as its cognate extender unit during synthesis of the 
polyketide encoded by Lig-PKS. 

EXAMPLE 2: Construction of plasmid pUC18/LigAT2 

Two PCR oligonucleotides (SEQ ID NO:5 and SEQ ID NO:6) were designed to subclone 
the 985 bp DNA segment encoding tile LigAT2 domain from tile Lig-PKS cluster and to 
introduce two unique restriction sites, ^4vrII and Nsil, for cassette cloning. The unique restriction 
sites Avrll and Nsil required for cassette cloning of the AT-encoding DNA were chosen based on 
multiple sequence alignment using tile programs PILEUP and PRETTY (GCG, Madison WI) 
which compared the amino acid sequences of LigAT2, venAT, rapAT2, rapATS, rapAT8, 
rapAT9, rapATll, rapAT12, rapAT14, eryATl, eryAT2, eryAT3, eryAT4, eryAT5, eryAT6, and 
a monofunctional AT from Streptomyces glaucescens (R.G. Summers et al., Biochemistry 
34:9389-9402 (1995)). The selection and positioning of the restriction enzyme sites were based 
on the following considerations: (i) extent of amino acid sequence conservation among the various 
ATs, with the sites being positioned outside, but near the regions of greatest conservation, (ii) 
absence of the sites from the heterologous AT-encoding DNA and the eryAT flanking DNA and 
(iii) impact of the amino acid sequence changes resulting from translation of these sites on the 
heterologous AT amino acid sequence. This necessitated nucleotide changes, shown in bold in 
FIG.8, at the beginning and near the end of the LigAT2-encoding DNA sequence. (In FIG. 8, the 
underlined nucleotides are the wild-type sequence.) In addition, two other restriction sites, EcoRl 
and BamHL 9 were also introduced at the 5' ends of the N-terminal and C-terminal 
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oligonnucleotides, respectively, for convenient subcloning of the PCR=generated product. The 
approximately 1 kb LigAT2 domain was amplified from Cosmid 58 as follow: The 100|iL PCR 
reaction mixture contained 10|iL of lOx PCR buffer (Bethesda Research Laboratories), 2(iL of 
lOmM dNTP mixture, 2-4 |iL of 50mM MgCl 2 , 100 pM of each oligo, 10-50 ng of template DNA 
and water to 100 jaL. Cycling conditions were as follows: One cycle at 96°C/6 min, 80°C/1 min 
(add 5 U Taq DNA Polymerase during this 1 min) and 72°C/2 min; 30 cycles at 95°C/1 min, 
65°C/1 min and 72°C/2 min with a 5 min extension at 72°C for the last cycle. The entire reaction 
was then run on a 1% agarose gel and the desired fragment was isolated with Prep-A-Gene 
(BioRad, Hercules, CA). The PCR product was digested with EcoKl and BamHI and subcloned 
into the EcoRI and BamHI sites of pUC18. The ligation mixture was transformed into E. coli 
DH5a (GIBCO BRL) according to the manufacturer's instructions and transformants were 
selected to LB plates containing 150 fiL/mL ampicillin and 50 \\L of a 2% solution of X-gal for 
blue/white selection. Clones were confirmed by restriction analysis and the fidelity of the insert 
was confirmed by DNA sequencing. The final plasmid construct was named pUC18/LigAT2. 

EXAMPLE 3: Construction of plasmid pEry AT 1 /Lig AT2 
pEryATl/LigAT2 was constructed using standard methods of recombinant DNA 
technology according to the schematic outlines of FIGS. 9 and 10. To construct a gene- 
replacement vector specific for the eryATl domain, the two DNA regions immediately adjacent to 
eryATl -encoding DNA were cloned and positioned adjacent to the LigAT2-encoding DNA as 
described in Example 2. The 5' and 3' boundaries of eryATl were designated as 3825 and 4866, 
and correspond to the deposited eryAI sequence (GenBank accession number M63676). To 
subclone the DNA fragment upstream of the eryATl domain encoding region from the Sac. 
erythraea chromosome, two PCR oligonucleotides (SEQ ID NO: 7 and SEQ ID NO: 8) were 
designed so that an EcoRL site was added at the 5' end of the region and Avrll-BamHl restriction 
sites were introduced at the 3' end. The 5 '-flanking region (about 1 kb) was PCR generated as 
described in EXAMPLE 2 using plasmid pAIEN22 DNA as template. (This plasmid is a pUC19 
derivative containing 22 kb of Sac. erythraea DNA from an EcoRl site upstream of eryAI to an 
Nhel site in eryAII cloned into EcoRI and Xbal cut pUC19). The PCR product was subcloned into 
EcoRI and BamHI sites of pUC19 and the ligated DNA transformed into E. coli DH5a (GIBCO 
BRL) according to the manufacturer's instructions. Clones were selected to LB plates containing 
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150 jag/mL ampicillin and 50 jxL of a 2% solution of X-gal for blue/white selection. Clones were 
confirmed by restriction analysis and the fidelity of the insert was confirmed by DNA sequencing. 
The resulting construct was named pUC19/AT 1/5 '-flank. 

For subcloning the 3 '-flanking region of the eryATl from Sac. erythraea chromosome, 
two PCR oligonucleotides (SEQ ID NO: 9 and SEQ ID NO: 10) were designed so that BamUl- 
Nsil restriction sites were introduced into the 5' end of the region and a Hindlll restriction site was 
added to the 3' end. The 3'-flanking region (about 1 kb) was also generated by PCR using 
pAIEN22 as template as described above. The PCR fragment was subcloned into the BamHl and 
Hindlll sites of pUC19 and the ligated DNA transformed into E. coli DH5a as above. Clones 
were selected on LB plates containing 150|ig/mL ampicillin and 50 \xL of a 2% solution of X-gal 
for blue/white selection. Clones were confirmed by restriction analysis and the fidelity of the 
insert was confirmed by DNA sequencing. This intermediate construct was named 
pUC19/ATl/5'-flank cut with BamBI and Hindlll. Ligated DNA was transformed into E. coli 
DH5a and clones isolated as described. The resulting plasmid was named pUC19/ATl -flank. 
The 2.1 kb EcoRl and Hindlll fragment from pUC19/ATl -flank was then isolated and ligated to 
pCS5 cut with the same enzymes to generate pCS5/ATl-flank. The final step in the construction 
of pEryATl/LigAT2 was to ligate the 1 kb LigAT2 fragment having AvrH and Nsil ends to 
pCS5/ATl -flank cut with the same enzymes to give the gene replacement/integration plasmid 
pEryATl/LigAT2. All ligation mixtures were transformed into the intermediate host E. coli 
DH5a and clones selected as previously described. 

EXAMPLE 4: Construction of Sac, erythraea ER720 EryATl /LigAT2 
An example of a 12-desmehtyl-12-deoxyerthryomycin A producing microorganism was 
prepared by replacing the DNA fragment encoding the methylmalonyl acyltransferase domain in 
module 1 of the erythromycin PKS (EryATl) of Sac. erythraea ER720 with a newly discovered 
DNA fragment encoding a malonyl acyltransferase domain (LigAT2) from S. hygroscopicus 
ATCC 29253. This was accomplished with the recombinant plasmid, pEry/AT2, prepared as 
described in Example 3. Transformation of Sac. erythraea ER720 and resolution of the 
integration event were carried out according to the following method. Sac. erythraea ER720 cells 
were grown in 50mL of SGGP medium (per 1 liter aqueous solution: 4 g peptone, 4 g yeast 
extract, 4 g casamino acids, 2 g glycine, 0.5g MgS04 • 7 H2O, 10 g glucose, 20 mL of 500 mM 
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KH2PO4) for 3 days at 32°C and then washed in 10 mL of 10.3% sucrose. The cells were 
resuspended in 10 mL of PM buffer containing 1 mg/mL lysozyme and incubated at 30°C for 15- 
30 minutes until most of the mycelial segments were converted into spherical protoplasts. (PM 
buffer per 1 liter aqueous solution: 200g sucrose, 0.25 g K2SO4 in 890 mL H2O, with the addition 
after sterilization of 100 mL 0.25 MTES, pH 7.2, 2 mL trace elements solution (Hopweeld, et al., 
1985, Genetic Manipulation of Streptomyces A Laboratory Manual, The John Innes Foundation), 
0.08 mL 2.5 M CaCl 2 , 10 mL 0.5% KH 2 P0 4 , 2 mL 2.5 M MgCl 2 .) The protoplasts were washed 
once with PM and then resuspended I 3 mL of the same buffer containing 10% DMSO for storage 
in 200|iL aliquots at -80°C. 

Transformation was accomplished by quickly thawing an aliquot of protoplasts, 
centrifuging for 15 seconds in a microfuge, decanting the supernatant, and resuspending the 
protoplasts in the PM remaining in the tube. Ten jaL of DNA solution was added (3|LiL of 
pEryATl/LigAT2 DNA from Example 3 at about ljag/juL in 7 jaL of PM buffer) and mixed with 
the protoplasts by gently tapping the tube. Two tenths of a mL of 25% PEG 8000 in T buffer 
(Hopwood, et al., 1985, Genetic Manipulation of Streptomyces A Laboratory Manual, The John 
Innes Institute) was then added, mixed by pipetting the solution 3 times and the suspension 
immediately spread on a dried R3M plate. The plate was incubated at 30°C for 20 hours and 
overlaid with 2 mL of water containing 100|ig/mL thiostrepton, dried briefly and incubated 4 
more days at 30°C. 

To select stable transformants (integrants) colonies arising on the transformation plates 
were re-streaked onto R3M plates containing thiostrepton (20 jaL/mL). Two colonies were 
confirmed to be thiostrepton resistant and one of these was inoculated into SGGP containing 
thiostrepton (10 jig/mL) to isolate chromosomal DNA for Southern analysis. Integration of the 
plasmid DNA into the ER720 chromosome was further confirmed by Southern hybridization (data 
not shown). Hybridization was at 65°C and the stringency wash was with 0.1 x SSC at 65°C. 

The confirmed integrant was grown in SGGP without antibiotic for four days and then 
plated onto non-selective R3M plates for sporulation. Spores were plated on R3M plates to obtain 
individual colonies, which were then screened for sensitivity to thiostrepton, indicating loss of the 
plasmid sequence from the chromosome. Five thiostrepton sensitive colonies were selected and 3 
of these were confirmed by Southern hybridization to have the EryATl replaced by LigAT2 
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(FIG.l 1). Hybridization was at 65°C and the stringency wash was with O.lxSSC at 65°C. The 
strain was named Sac. erythraea ER720 EryATl/LigAT2. 

EXAMPLE 5: Analysis of compounds produced by Sac, erythraea ER720 EryATl/LigAT2 

Compounds produced by the recombinant Sac. erythraea strain, ER720 EryATl/LigAT2, 
whose construction is described in Example 4, were characterized by TLC, bioautography, mass 
spectrometry and NMR analysis. 

For TLC analysis cells were grown in either SGGP or SCM medium (20g Soytone, 15G 
Soluble Starch, 10.5g MOPS, 1.5g Yeast Extract and 0.1 g CaCL 2 per liter of distilled H 2 0) for 4-5 
days at 30°C. 1.5 mL of culture was centrifuged for 1 minute in a microfuge to remove cells. One 
mL of the resulting supernatant was removed to another microfuge tube and the pH adjusted to 9.0 
by the addition of 6jaL of NH 4 OH. Then 0.5 mL of ethyl acetate was added, the tube was 
vortexed for 10 sec and then centrifuged for approximately 5 min to achieve phase separation. 
The organic phase was removed to another tube, and the aqueous phase was re-extracted with 0.5 
mL of ethyl acetate. The second organic phase was combined with the first and dried in a Speed 
Vac. The residue was taken up in 10 |iL of ethyl acetate and 5jaL was spotted onto a Merck 60F- 
254 silica gel TLC plate. The plate was run in isopropyl ether:methanol: NH4OH (75:35:2). 
Erythromycin derivatives were visualized by spraying the plates with anisaldehyde: sulfuric 
acid:ethanol (1:1:9). Using this reagent, a novel compound predicted to be 12-desmethyl-12- 
deoxyerythromycin A, appeared as a blue spot running slightly faster than erythromycin A (FIG. 
12). 

To detect biological activity, a TLC bioautography assay was performed. In this assay, 
one microliter of the extracted sample from above was spotted onto a TLC plate which was run as 
described above. The plate was then air-dried and placed in a sterile bio-assay dish (245x245x25 
mm). The plate was then covered with 100 mL of antibiotic medium 1 1 (DIFCO-BACTO) 
containing Staphylococcus aureus as an indicator strain and incubated overnight at 37°C. As with 
the positive controls, a clear zone of inhibition developed around the sample spot indicating that 
the novel compound had bioactivity. 

To determine whether the novel spot seen on TLC had the molecular mass corresponding 
to the predicted 12-desmethyl-12-deoxyerythromycin A, an ethyl acetate extract was further 
analyzed by mass spectrometry. The mass spectrometry samples were isolated by TLC basically 
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as described above except that plates were not sprayed with the anisaldehyde reagent. The region 
of the novel spot was instead scraped form the TLC plate and the silica resin re-extracted with 
ethyl acetate-methanol (1:1) and then twice with ethyl acetate. The combined solvent phases were 
then dried in a Speed Vac. Mass spectrometric analysis revealed the novel compound to have a 
mass of 704, which corresponds to the molecular ion plus a proton (M+H + ) of 12-desmethyl-12- 
deoxyerythromycin A. 

To acquire milligram quantities of highly purified material for performance of NMR 
analysis, the culture was grown in a 42-liter LH Fermentation Series 2000 fermentor. SCM 
medium was used for growth of inoculum and for the fermentation. Seed for the fermentation was 
grown in two steps. In the first step, frozen vegetative inoculum was used to seed 100 mL of 
SCM medium in a 500 mL Erlenmeyer flasks containing 600 mL of SCM medium were seeded at 
5% from the first passage growth. Each step was incubated for 3 days at 32°C on a rotary shaker 
operated at 225 rpm. 

Thirty liters of SCM medium were prepared in the 42-liter fermentor and sterilized at 
121°C and 15 psi for 1 hour. Antifoam (XFO-371, Ivanhoe Chemical Co., Mundelein, IL) was 
added initially at 0.01% and then was available on demand. The fermentor was inoculated with 
1 .5 liters of the second passage seed growth. The temperature was controlled at 32°C. The 
agitation rate was 260 rpm and the air flow was 1.3 vol/vol/min. The head pressure was 
maintained at 6 psi. During fermentation pH was controlled at 7.3 with 5 M propionic acid. The 
fermentation was terminated at 1 1 1 hours, and the fermentation beer was adjusted to pH 8. This 
was followed by two extractions with equal volumes of CH2CI2. The pooled CH2CI2 extract was 
then concentrated to approximately 400 mL and extracted twice with equal volumes of 0.05 M 
aqueous potassium phosphate pH 5.5. The aqueous phase was pooled and adjusted to pH 8, and 
then extracted twice with equal volumes of ethyl acetate. The ethyl acetate extracts were pooled 
and concentrated to yield 5 ml oil. The extraction sequence described above was then repeated to 
yield 600 mg of oil after concentration. Next, the sample was split and each half was digested in 
2.5 ml each of the upper and lower phases of a solvent system consisting of n-hexane, ethyl 
acetate, 0.02 M aqueous potassium phosphate, pH 8, (1:1:1, v/v/v). These were then 
chromatographed on the Coil Planet Centrifuge using the upper phase as the mobile phase. 
Fractions were analyzed by bioassay against Staphylococcus aureus and ! H NMR. Two macrolide 
containing peaks of bioactivity were observed and the later eluting species was readily 
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characterized by its ] H and 13 C NMR spectra as 12-desmethyl-12-deoxy erythromycin A. 
Parameters from the l H NMR spectra are listed in Table 2. The assignments were made with the 
aid of correlational spectroscopy (COSY), heteronuclear multiple quantum correlation (HMQC), 
heteronuclear multiple bond correlation (HMBC), and distortionless enhancement by polarization 
transfer (DEPT) experiments. Mass spectral data of this sample was also consistent with the 
structural assignment. Electrospray ionization (WSI) of this sample revealed an M+H + ion at M/Z 
704, which is in full accord with erythromycin A lacking both a methyl group and a hydroxyl 
group. 



Table 2 

! H NMR chemical shift (d) assignments for 12-desmethyl-12-deoxyerythromycin A 

in CDC1 3 



2-H 


2.74 


l'-H 


4.47 


3-H 


4.15 


2'-H 


3.25 


4-H 


2.01 


3'-H 


2.49 


5-H 


3.58 


4'-Ha 


1.67 


7-Ha 


1.91 


4'-Hb 


1.23 


7-Hb 


1.66 


5'-H 


3.54 


8-H 


2.86 


6'-H 3 


1.23 


10-H 


2.70 


N(CH 3)2 


2.30 


11-H 


4.05 


1"-H 


4.85 


12-Ha 


1.71 


2"-Ha 


2.40 


12-Hb 


1.46 


2"-Hb 


1.59 


13-H 


5.06 


4"-H 


3.03 


14-H 2 


1.59 


5"-H 


4.04 


I5-H3 


0.89 


6"-H 3 


1.30 


2-CH3 


1.19 


3"-CH 3 


1.25 


4-CH3 


1.13 


OCH3 


3.33 


6-CH3 


1.38 






8-CH3 


1.19 






IO-CH3 


1.11 







EXAMPLE 6: Construction of plasmid pEryAT2/LigAT2 
pEryAT2/LigAT2 was constructed using standard methods of recombinant DNA 
technology. To make a gene-replacement vector specific for the eryAT2 domain, two DNA 
regions flanking eryAT2 were cloned and positioned adjacent to the DNA encoding the domain to 
be inserted in order to effect homologous recombination. Boundaries of the AT2 domain were 
chosen as described in Example 2. The 5' and 3' boundaries of eryAT2 are designated as 8255 
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and 9282, respectively, and correspond to deposited eryAI sequence (GenBank accession number 
M63676). To subclone the DNA fragment upstream of the eryAT2 DNA, two PCR 
oligonucleotides (SEQ ID NO: 1 1 and SEQ ID NO: 12) were designed so that a//mdIII site was 
added at the 5' end of the region and Avrll-Pstl restriction sites were introduced at the 3' end. For 
subcloning the 3 '-flanking region of eryAT2, two PCR oligonucleotides (SEQ ID NO: 13 and 
SEQ ID NO: 14) were designed so that Pstl-Nsil restriction sites were introduced at the 5' end of 
the region and an EcoRI site at the 3' end. Both the 5'-flanking and 3'-flanking regions (about 1 
kb each) were PCR generated as described in Example 3. In the case of the 5 '-flanking region, the 
PCR product of the 3 '-flanking region was subcloned into the PstI and EcoKL sites of pUC18. 
Ligations, transformations and confirmations of selected clones were performed as in Example 3. 
The resulting construct containing the AT2 5 '-flanking region was designated pUC18/AT2/5'- 
flank and the construction containing the AT2 3 '-flanking region was designated pUC18/AT2/3'- 
flank. The two flanking regions were then joined by first isolating the 2 kb PstI and EcoRI 
fragment (3 '-flank) from pUC18/AT2/3 '-flank, and ligating this fragment to pUC18/AT2/5 '-flank 
cut with PstI and EcoRI. The ligation was transformed into E, coli DH5ct and clones isolated as 
described. The resulting plasmid was named pUC18/AT2-flank (FIG. 13). The 2.2 kb EcoRI and 
Hindlll fragment from pUC18/AT2-flank was then isolated and ligated to pCS5 cut with the same 
enzymes to generate pCS5/AT2-flank. The final step in the construction of pEryAT2/LigAT2 was 
to ligate the LigAT2 encoding DNA fragment from pUC18/LigAT2 havig ^4vrII and Nsil ends 
(described in Example 2) to pCS5/AT2-flank cut with the same enzymes to give the gene 
replacement, integration plasmid pEryAT2/LigAT2 (FIG. 14). All ligations were transformed into 
the intermediate host E. coli DH5a and clones selected as previously described. 



EXAMPLE 7: Construction of Sac, erythraea ER72Q EryAT2/LigAT2 
An example of a 10-desmethylerythromycin A and 10-desmethyl-12-deoxyerythromycin A 
producing microorganism was prepared by replacing the methylmalonyl acyltransferase domain of 
module 3 of the erythromycin PKS (EryAT2) of Sac, erythraea ER720 with a newly discovered 
malonyl acyltransferase domain (LigAT2) from S. hygroscopicus ATCC 29253. This was 
accomplished with the recombinant plasmid, pEryAT2/LigAT2, prepared as described in Example 
6. Transformation of ER720 and resolution of the integration event were carried out according to 
the procedures described in Example 4 using 10 jaL of a DNA solution consisting of 3]aL of 
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pEryAT2/LigAT2 DNA from Example 6 at about ljag/jiL of PM buffer. Three colonies were 
confirmed to be thiostrepton resistant and were inoculated into SGGP containing thiostrepton 
(10|ng/mL) to isolate chromosomal DNA from Southern analysis. Integration of the plasmid DNA 
into ER720 chromosome was further confirmed by Southern hybridization (data not shown). 
Hybridization was at 65°C and the stringency wash was with 0.1 x SSC at 65°C. 

The confirmed integrant was grown in SGGP without antibiotic for four days and then 
plated onto non-selective R3M plates for sporulation. Spores were plated on R3M plates to obtain 
individual colonies, which were then screened for sensitivity to thiostrepton, indicating loss of the 
plasmid sequence from the chromosome. Two thiostrepton sensitive colonies were selected and 
one of these was confirmed by Southern hybridization to have the EryAT2 replaced by LigAT2 
(FIG. 15). Hybridization was at 65°C and the stringency wash was with 0.1 x SSC at 65°C. The 
strain was named Sac. erythraea ER720 EryAT2/LigAT2. 

EXAMPLE 8: Analysis of compounds produced by Sac, erythraea ER720 EryAT2/LigAT2 
Compounds produced by the recombinant Sac. erythraea strain, ER720 EryAT2/LigAT2, 
whose construction is described in Example 7, were characterized by TLC, bioautography and 
mass spectrometry. 

For small scale analysis, the cells were grown in either SGGP or SCM medium for 4-5 
days at 30°C. 15 mL of culture was centrifuged for 10 minute in a Sorval GLC-4 General 
Laboratory Centrifuge at setting 10 to remove cells. Ten mL of the resulting supernatant was pH 
adjusted to 9.0 by the addition of 60(iL of NH4OH. Then 5 mL of ethyl acetate was added, the 
tube was shaken vigorously for 3 minutes and then centrifuged for approximately 5 min to achieve 
phase separation. The organic phase was removed to another tube, and the aqueous phase was re- 
extracted with 5 mL of ethyl acetate. The second organic phase was combined with the first and 
dried in a Speed Va. The residue was taken up in 20|uL of ethyl acetate and IOjiL was spotted 
onto a Merck 60F-254 silica gel TLC plate. The plate was run in isopropyl ether: methanol 
NH4OH (75:35:2). Erythromycin derivatives were visualized by spraying the plates with 
anisaldehyde:sulfuric acid:ethanol (1:1 :9). Using this reagent, two novel compounds predicted to 
be 1-desmethylerythromycin A and 10-desmethyl-12-deoxyerythromycin A, appeared as blue 
spots with the lower spot running slightly slower than erythromycin A and upper spot running 
slightly faster than erythromycin A (FIG. 16). 
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To detect biological activity, a TLC-bioautography assay was performed. In this assay, 
0.2, to 1 microliter of the extracted sample form above was spotted onto a TLC plate which was 
run as described. The plate was then air-dried and placed in a sterile bio-assay dish (245x245x25 
mm). The plate was then covered with 100 mL of antibiotic medium 1 1 (DIFCO-BACTO) 
containing Staphlococcus aureus as an indicator strain. The inhibition zones were developed by 
overnight incubation of the plate at 37°C. As shown in FIG. 17 (TLC-bioautography), the two 
novel spots (compounds) each have bioactivity against Staphylococcus aureus. 

To determine whether the novel spots seen on TLC had the molecular masses 
corresponding to the predicted 10-desmethylerythromycin A and 10-desmethyl-12- 
deoxyerythromycin A, an ethyl acetate extract was further analyzed by mass spectrometry. The 
mass spectrometry samples were isolated by TLC similarly to the method described above except 
that plates were not sprayed with the anisaldehyde reagent. Instead, two regions which contain the 
novel spots were scraped from the TLC plate and the silica resin ere-extracted with ethyl 
acetateOemthanol (1:1) and then twice with ethyl acetate. The combined solvent phases were then 
dried in a Speed Vac. In addition to the samples described above, a crude ethyl acetate extract 
was also analyzed by LC-MS, in which the sample components were first separated by liquid 
chromatography and then analyzed by mass spectrometry. Mass spectrometric analysis revealed 
the two novel compounds to have masses of 720 and 704, which correspond to the molecular ion 
plus a proton (M+H + ) of 1 0-desmethylerythromycin A and 10-desmethyl-12-deoxyerythromycin 
A, respectively. 



EXAMPLE 9: Cloning of the venAT Domain from Streptomyces venezuelae 
A genomic library of Streptomyces venezuelae ATCC 15439 DNA was constructed in the 
bifunctional cosmid pNJl (Tuan, et al., Gene 90: 21-29 (1990)) using standard methods of 
recombinant DNA technology. A cosmid from this library, pVenl7, was characterized by 
Southern analysis and Sstl fragments of approximately 3.5, 3.8, and 4.0 kb were found to 
hybridize to a 1.37 kb Smal fragment that encompasses the ketosynthase (KS) domain from 
module 2 of the erythromycin PKS gene eryAI (Donadio et al., Science 252:675-679 (1991)). The 
4.0 kb Sstl fragment was then subcloned into pUC19 to give pVen4.0. The nucleotide sequence of 
pVen4.0 insert DNA was determined from single strand DNA templates prepared form M13mpl8 
and M13mpl9 (Yanisch-Perron, et al., Gene, 33:103 (1985)) subclones using Sequenase version 
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2.0 with 7-deaza-dGTP (United States Biochemical, Cleveland, OH) and 5 '-[a- 2 F] or 5 '-[a- 3 P]- 
dCTP (NEN Research Products, Boston, MA). Because pVen4.0 did not contain the entire AT 
domain, the nucleotide sequence was extended using pVenl7 DNA as the template. The 
nucleotide sequence of the venAT domain (SEQ ID NO: 2) and its corresponding amino acid 
sequence (SEQ ID NO: 32) is shown in FIG. 18 (top and bottom strands respectively). 

EXAMPLE 10: Construction of plasmid pEry/ ATI /venAT 
pEryATl/venAT was constructed using standard methods of recombinant DNA 
technology according to the schematic outlines of FIGS. 23 and 24. Two PCR oligonucleotides 
(SEQ ID NO: 15 and SEQ ID NO: 16) were designed to subclone the 1.03 kb DNA fragment that 
encodes the venAT domain (FIG. 19) from the S. venezuelae PKS cluster and to introduce two 
unique restriction sites, Avrll and Nsil, for cassette cloning (described in Example 2). This 
necessitated nucleotide changes (shown in bold in FIG. 19( at eh beginning and near the end of the 
venAT sequence (underlined nucleotides are the wild-type sequence). In addition, two other 
restriction sites, EcoRI and BamHI, were also introduced at the 5' ends of the N-terminal and C- 
terminal oligonucleotides, respectively, for convenient subcloning of the PCR-generated product. 
The approximately 1 kb venAT-encoding DNA was PCR amplified from cosmid pVenl7 template 
DNA (Example 2) using VentR® DNA Polymerase (New England Biolabs). A typical PCR 
reaction contained 10 jaL of 20% glycerol, 55jaL water, 100 pmole of each primer, digested 
pUC18 and transformed into E. coli DH5ot (GIBCO BRL) according to the manufacturer's 
instructions. Clones were selected on LB plates containing 150 |ig/mL ampicillin and 50 \iL of a 
2% solution of X-gal for blue/white selection. Clones were confirmed by restriction analysis and 
the fidelity of the insert was confirmed by DNA sequencing. The final construct was named 
pUC18/venAT. 

The final step in the construction of pEryATl /venAT was to ligate the 1 kb venAT 
fragment having Avrll and Nsil ends to pCS5/ATl -flank (Example 3) cut with the same enzymes 
to give the gene replacement/integration plasmid pEryATl/venAT (FIG. 20). All ligations were 
transformed into the intermediate host E. coli DH5a and clones selected as previously described. 

Example 1 1 : Construction of Sac, erythraea ER720 EryATl /venAT 
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A 12-desmethyl-12-deoxyerythromycin A producing microorganism was prepared by 
replacing the methylmalonyl acyltransferase domain of module 1 of the erythromycin PKS 
(EryATl) of Sac. erythraea ER720 with a newly discovered malonyl acyltransferase domain 
(venAT) from S. venezuelae ATCC 15439. This was accomplished with the recombinant plasmid, 
pEryATl /venAT, prepared as in Example 10. Transformation of ER720 and resolution of the 
integration event were carried out as described in Example 4 using 10 \xL of DNA solution 
consisting of 3 jaL of pEryATl/venAT DNA at about 1 jag/mL in 7 \\L of PM buffer. One 
thiostrepton resistant colony was isolated and was inoculated into SGGP containing thiostrepton 
(10 ng/mL) to isolate chromosomal DNA for Southern analysis. Integration of the plasmid DNA 
into the ER720 chromosome was further confirmed by Southern hybridization (data not shown). 
Hybridization was at 65°C and the stringency wash was with 0.1 x SSC at 65°C. 

The confirmed integrant was grown in SGGO without antibiotic for four days and then 
diluted 1000 fold into fresh medium and grown for 4 more days. Cells were then plated onto non- 
selective R3M plates for sporulation. Spores were plated on R3M plates to obtain individual 
colonies, which were then screened for sensitivity to thiostrepton, indicating loss of the plasmid 
sequence from the chromosome. Four thiostrepton sensitive colonies were selected and 2 of these 
were confirmed by Southern hybridization, using conditions described above, to have the EryATl 
replaced by venAT (FIG. 21). The strain was named Sac. erythraea ER720 EryATl/venAT. 

Example 12: Analysis of compounds produced by Sac, erythraea ER720 EryATl /venAT 
Compounds produced by the recombinant Sac. erythraea strain, ER 720 EryATl/venAT, 
whose construction is described in Example 1 1, were characterized by TLC, bioautography, and 
mass spectrometry. 

For TLC analysis cells were grown in either SGGP or SCM medium (20 g Soytone, 15 G 
Soluble Starch, 10.5 g MOPS, 1.5 g Yeast Extract and 0.1 g CaCl 2 per liter of distilled H 2 0) for 4- 
5 days at 30°C. The culture (1.5 mL) was centrifuged for 1 minute in a microfuge to remove cells. 
One mL of the resulting supernatant was removed to another microfuge tube and the pH adjusted 
to 9.0 by the addition of 6 jiL of NH 4 OH. Then 0.5 mL of ethyl acetate was added, the tube was 
vortexed for 10 sec and then centrifuged for approximately 5 min to achieve phase separation. 
The organic phase was removed to another tube, and the aqueous phase was re-extracted with 0.5 
mL of ethyl acetate. The second organic phase was combined with the first and dried in a Speed 
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Vac. The residue was taken up in 10 |iL of ethyl acetate and the entire sample was spotted onto a 
Merck 60F-254 silica gel TLC plate. The plate was run in isopropyl ethenmethanol: NH4OH 
(75:35:2). Erythromycin derivatives were visualized by spraying the plates with 
anisaldehyde: sulfuric acid:ethanol (1 :1 :9). Using this reagent, a novel compound predicted to be 
12-desmethyl-12-deoxyerythromycin A, appeared as a blue spot running slightly faster than 
erythromycin A (FIG. 22). 

To detect biological activity, a TLC-bioautography assay was performed. In this assay, 
one jaL of an extract prepared as above was spotted onto a TLC plate which was run as described 
above. The plate was then air-dried, placed face down on top of 100 mL of antibiotic medium 1 1 
(DIFCO-BACTO) containing Staphylococcus aureus as an indicator strain in a sterile bio-assay 
dish (245x245x25 mm) and incubated overnight at 37°C. As with the positive controls, a clear 
zone of inhibition developed around the sample spot indicating that the novel compound was 
bioactive. 

To determine whether the novel spot seen on TLC had the molecular mass corresponding 
to the predicted 12-desmethyl-12-deoxyerythromycin A, an ethyl acetate extract was further 
analyzed by mass spectrometry. The mass spec samples were isolated by TLC basically as 
described above except that plates were not sprayed with anisaldehyde. The region of the novel 
spot was instead scraped from the TLC plate and the silica resin re-extracted with ethyl acetate- 
methanol (2:1) and then twice with ethyl acetate. The combined solvent phases were then dried in 
a Speed Vac. Mass spectrometric analysis revealed the novel compound to have a mass of 704, 
which corresponds to the molecular ion plus a proton (M+H + ) of 12-desmethyl-12- 
deoxyerythromycin A. 

EXAMPLE 13: Construction of plasmid pUC19/rapAT14 
Two PCR oligonucleotides (SEQ ID NO: 17 and SEQ ID NO: 18) were designed to 
subclone the 1023 bp rapAT14-encoding DNA fragment from the rapamycins biosynthetic gene 
cluster (GenBank Accession #: X86780) and to introduce two unique restriction sites, Avrll and 
Nsil, for cassette cloning (described in Example 2). This necessitated nucleotide changes (shown 
in bold in FIG. 23) at the beginning and near the end of the rapAT14 sequence. (In FIG. 23, the 
underlined nucleotides are the wild-type sequence.) In addition, two other restriction sites, EcoKL 
and Hindlll, were also introduced at the 5' ends of the N-terminal and C-terminal 
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oligonucleotides, respectively, for convenient subcloning of the PCR-generated product. The 
approximately 1 kb rapAT14-encoding DNA was amplified by PCR using chromosomal DNA 
from Streptomyces hygroscopicus ATCC 29253 as template. The PCR conditions were as 
follows: The 100 |iL reaction mixture contains 10 jj,L of lOx Thermopol Buffer (New England 
Biolabs), 2% glycerol, 10% formamide, 100 pmoles of each oligo, 100-200 ng of template DNA 
and water to 84 jiL. The sample was then heated to 99°C for two minutes followed by cooling to 
80°C for two minutes at which time 16 jiL of a dNTP solution (1 .25 mM dATP and dTTP, 1 .5 
mM dCTP and dGTP) and 1 jiL of VentR® DNA Polymerase (New England Biolabs) was added. 
Cycling was as follows: 30 cycles at 96.5°C/35 sec, 65°C/1 min and 72°C/1.5 min followed by 
one cycle at 72°C for 3 min. The entire reaction was then run on a 1.2% low-melting agarose gel 
and the desired fragment was isolated by melting the appropriate gel slice at 65°C, adding 3 
volumes of TE buffer, extracting 2X with phenol and once with chloroform, and ethanol 
precipitating the aqueous phase. The isolated DNA was ligated directly into Hindi digested 
pUC19. The ligation mixture was transformed into E. coli DH5a (GIBCO BRL) according to the 
manufacturer's instructions and transformants were selected on LB plates containing 150 |ng/mL 
ampicillin and 50|iL of a 2% solution of X-gal for blue- white selection. Clones were confirmed 
by restriction analysis and the fidelity of the insert was confirmed by DNA sequencing. The final 
plasmid construct was named pUC19/rapAT14. 



EXAMPLE 14: Construction ofplasmid pEryAT 1 /rap AT 1 4 
pEryATl/rapAT14 was constructed using standard methods of recombinant DNA 
technology according to the schematic outlines of FIG. 24. To make a gene-replacement-vector 
specific for the eryATl domain, the two DNA regions immediately adjacent to eryATl were 
cloned and positioned adjacent to the DNA encoding the rapAT14 domain in order to allow 
homologous recombination to occur. The strategy and protocol for constructing the intermediate 
plasmid containing the flanking regions, pCS5/ATl -flank, are described in Example 3 and FIG. 9. 
To insert the rapAT14 fragment between the flanking regions, pUC19/rapAT14 (from Example 
13) was digested with Nsil and Avrll and the resulting 1 kb fragment was isolated from a 0.8% 
agarose gel with Prep-A-Gene. pCS5/ATl -flank was also digested with these enzymes and the 
linearlized plasmid was isolated from 0.8% agarose gel. The two fragments were ligated, 
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transformed into the intermediate host E. coli DH5a and ampicillin resistant clones were selected 
as previously described. Insertion of the rapAT14 fragment between the ery flanking regions was 
confirmed by restriction analysis and the resulting plasmid was called pEryATl/rapAT14. 

EXAMPLE 15: Construction of Sac, erythraea ER720 EryATl/rapAT14 
An example of a 12-desmethyl-12-deoxyerythromycin A producing microorganism was 
prepared by replacing the methylmalonyl acyltransferase domain of module 1 of the erythromycin 
PKS (EryATl) of Sac. erythraea ER720 with the acyltransferase domain from module 14 of the 
rapamycins PKS from S. hygroscopicus ATCC 29253. This was accomplished with the 
recombinant plasmid, pEryATl/rapAT14, prepared as described in Example 14. Transformation 
of Sac. erythraea ER 720 and resolution of the integration event were carried out according to the 
following method. Sac. erythraea ER720 cells were grown in 50 mL of SGGP medium (per 1 
liter aqueous solution : 4 g peptone, 4 g yeast extract, 4 g casamino acids, 2 g glycine, 0.5 g 
MgS0 4 *7 H 2 0, 10 g glucose 20 mL of 500mM KH 2 P0 4 ) for 3 days at 32°C and then washed in 
10 mL of 10.3% sucrose. The cells were resuspended in 10 mL of PM buffer containing 1 mg/mL 
lysozyme and incubated at 30°C for 15-30 minutes until most of the mycelial fragments were 
converted into spherical protoplasts. (PM buffer per 1 liter aqueous solution: 200 g sucrose, 0.25 g 
K 2 S0 4 in 890 mL H 2 0, with the addition after sterilization of 100 mL 0.25 M TES, pH7.2, 2 mL 
trace elements solution (Hopwood, et al, 1985, Genetic Manipulation of Streptomyces A 
Laboratory Manual, The John Innes Foundation), 0.08 mL 2.5 M CaCl 2 , 10 mL 0.5% KH 2 P0 4 , 2 
mL 2.5M MgCl 2 .) The protoplasts were washed once with PM and then resuspended in 3 mL of 
the same buffer. 

Transformation was accomplished by centrifuging 200(iL of protoplasts for 1 5 seconds in 
a microfuge, decanting the supernatant, and resuspending the protoplasts in the PM remaining in 
the tube. Ten |iL of DNA solution was added (3|aL of pEryATl/rapAT14 DNA from Example 14 
at about l|^g/|iL in 7 (xL of PM buffer) and mixed with the protoplasts by gently tapping the tube. 
Two tenths of a milliliter of 25% PEG 8000 in T buffer (Hopwood, et al, 1985, Genetic 
Manipulation of Streptomyces A Laboratory Manual, The John Innes Institute) was then added, 
mixed by pipetting the solution 3 times and the suspension immediately spread on a dried R3M 
plate. The plate was incubated at 30°C for 20 hours and overlaid with 2 mL of water containing 
100 (ig/mL thiostrepton, dried briefly and incubated 4 more days at 30°C. 
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To select stable transformant (integrants) colonies arising on the transformation plates 
were re-streaked onto R3M plates containing thiostrepton (20 |ig/mL). Four colonies were 
confirmed to be thiostrepton resistant and were inoculated into 30 mL of SGGP containing 
thiostrepton (10(ag/mL). After growth for 3 days, one mL of each culture was extracted with ethyl 
acetate as described in Example 5, and run on a TLC plate to confirm that the strains were no 
longer making erythromycin A due to insertional inactivation by the integrating plasmid. 

Integrants #1 and #4 were grown in SGGP without antibiotic for four days and then plated 
onto non-selective R3M plates for sporulation. Spores were plated on R3M plates to obtain 
individual colonies, which were then screened for sensitivity to thiostrepton, indicating loss of the 
plasmid sequence from the chromosome. Six thiostrepton sensitive colonies were isolated from 
integrant #4 and one of these (4-A-l) was confirmed by Southern hybridization to have the 
EryATl replaced by the rapAT14 (FIG. 25). Hybridization was at 65°C and the stringency wash 
was with 0.1 x SSC at 65°C. The strain was named Sac. erythraea ER 720 EryATl/rapAT14. 

EXAMPLE 16: Analysis of compounds produced by 
Sac, erythraea ER720 EryATl/rapATH 
Compounds produced by the recombinant Sac. erythraea strain, ER720 EryATl/rapATM, 
whose construction is described in Example 15, were characterized by TLC and mass 
spectrometry. For TLC analysis strain 4-A-l was grown in SCM medium (20 g Soytone, 15 g 
Soluble Starch, 10.5 g MOPS, 1.5 g Yeast Extract and 0.1 g CaCl 2 per liter of distilled H 2 0) for 4 
days at 30°C. The culture (1.5 mL) was centrifuged for 1 minute in a microfuge to remove cells. 
One mL of the resulting supernatant was removed to another microfuge tube and the pH adjusted 
to 9 by the addition of 6|iL of NH 4 OH. Then 0.5 mL of ethyl acetate was added, the tube was 
vortexed for 10 sec and then centrifuged for approximately 5 min to achieve phase separation. 
The organic phase was removed to another tube, and the aqueous phase was re-extracted with 0.5 
mL of ethyl acetate. The second organic phase was combined with the first and dried in a Speed 
Vac. The residue was taken up in \3\\L of ethyl acetate and IOjiL was spotted onto a Merck 60F- 
254 silica gel TLC plate. The plate was run in isopropyl ether:methanol: NH 4 OH (75:35:2). 
Erythromycin derivatives were visualized by spraying the plates with anisaldehyde:sulfuric 
acid:ethanol (1:1 :9). Using this reagent, a novel compound predicted to be 12-desmethyl-12- 
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deoxyerythromycin A, appeared as a blue spot running slightly faster than erythromycin A (FIG. 
26). 

To determine whether the novel spot seen on TLC has the molecular mass corresponding 
to the predicted 12-desmehtyl-12-deoxyerythromycin A, an ethyl acetate extract was further 
analyzed by Mass Spectromety. Sac. erythraea ER720 EryATl/rapAT14 was grown in SCM 
medium for 4 days. Ten mL of culture was centrifuged to remove mycelia and pH of the 
supernatant was adjusted to 9 with NH4OH. The supernatant was then extracted twice with ethyl 
acetate and the organic phases pooled and dried. As shown in FIG. 33, mass spectrometric 
analysis of this crude ethyl acetate extract shows the mass of the novel spot to be 704, which 
corresponds to the molecular ion plus a proton (M+H+) of 12-desmethyl-12-deoxyerythromycin 
A. 

Example 17: Construction of plasmid pEryAT2/rapAT14 
pEryAT2/rapAT14 was constructed using standard methods of recombinant DNA 
technology according to the schematic outlines of FIGS. 15 and 34. To make a gene-replacement- 
vector specific for the ery AT2 domain, the two DNA regions immediately adjacent to ery AT2 
were cloned and positioned adjacent to the DNA encoding the rapAT14 domain in order to allow 
homologous recombination to occur. The strategy and protocol for constructing the intermediate 
plasmid containing the flanking regions, pCS5/AT2-flank, are described in Example 6 and FIG. 
14. The final step in the construction of pEryAT2/rapAT14 was to ligate the 1 kb rapAT14- 
encoding DNA fragment having Avrll and Nsil ends to pCS5/AT2-flank (Example 6) cut with the 
same enzymes to give the gene replacement/integration plasmid pEryAT2/rapAT14 (FIG. 27). All 
ligations were transformed into the intermediate host E. coli DH5a and clones selected as 
previously described. 

Example 18: Construction of Sac, erythraea ER720 EryAT2/rapAT14 
A 10-desmethylerythromycin A and 10-desmethyl-12-deoxyerythromycin A producing 
microorganism was prepared by replacing the DNA fragment encoding the methylmalonyl 
acyltrnasferase domain of module 2 of the erythromycin PKS (EryAT2) of Sac. erythraea ER720 
with a DNA fragment encoding a malonyl acyltransferase domain (rapAT14) from S. 
hygroscopicus ATCC 29253. This was accomplished with the recombinant plasmid, 
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pEryAT2/rapAT14, prepared as described in Example 17. Transformation of ER720 and 
resolution of the integration event were carried out as describe din Example 4 using 10|iL of DNA 
solution consisting of 3(iL of pEryAT2/rapAT14 DNA at about l|ag/|jL in 7|uL of PM buffer. One 
thiostrepton resistant colony, was isolated and was inoculated into SGGP containing thiostrepton 
(10 |Lig/|iL) to isolate chromosomal DNA for Southern analysis. Integration of the plasmid DNA 
into the ER720 chromosome was further confirmed by Southern hybridization (data not shown). 
Hybridization was at 65°C and the stringency wash was with 0.1 x SSC at 65°C. 

The confirmed integrant was grown in SGGP without antibiotic for four days and then 
diluted 1000 fold into fresh medium and grown for 4 more days. Protoplasts were then prepared 
and plated onto non-selective R3M plates to obtain individual colonies, which were then screened 
for sensitivity to thiostrepton, indicating loss of the plasmid sequence from the chromosome. Four 
thiostrepton sensitive colonies were selected and 3 of these were confirmed by Southern 
hybridization, using conditions described above, to have the EryAT2 replaced by rapAT14 (FIG. 
28). The strain was named Sac. erythraea ER720 EryAT2/rap AT 1 4. 

Example 19: Analysis of compounds produced by 
Sac, erythraea ER720 EryAT2/rapAT14 
Compounds produced by the recombinant Sac. erythraea strain, ER720 EryAT2/rapAT14, 
whose construction is described in Example 18, were characterized by TLC, bioassay, and mass 
spectrometry. 

For TLC analysis cells were grown in either SGGP or SCM medium (20 g Soytone, 15 g 
Soluble Starch, 10.5 g MOPS, 1.5 g Yeast Extract and 0.1 g CaCl 2 per liter of distilled H 2 0) for 4- 
5 days at 30°C. Culture (22mL) was centrifuged for 5 minute to remove cells. The resulting 
supernatant was removed to another tube and the pH adjusted to 9,0 by the addition of 6|iL/mL of 
NH4OH. Then an equal volume of ethyl acetate was added, the liquid was mixed for 2 min. and 
then centrifuged for approximately 5 mins. to achieve phase separation. The organic phase was 
removed to another tube, and the aqueous phase was re-extracted with half volume of ethyl 
acetate. The second organic phase was combined with the first and dried in a Speed Vac. The 
residue was taken up in 100(aL of ethyl acetate and one fourth of the sample was spotted onto a 
Merck 60F-254 silica gel TLC plate. The plate was run in isopropyl ethenmethanol: 
NH40H(75:35:2). Erythromycin derivatives were visualized by spraying the plates with 
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anisaldehyde:sulfuric acid:ethanol (1:1 :9). Using this reagent, two novel compounds predicted to 
be 10-desmethylerythromycin A and 10-desmethyl-12-deoxy erythromycin A, appeared as blue 
spots with the lower spot running slightly slower than erythromycin A and upper spot running 
slightly faster than erythromycin A (FIG. 29). 

To detect biological activity, a bioassay was performed. IN this assay, another fourth of 
the extracted sample for above was spotted onto a disc. The disc was the air-dried and placed over 
a plate containing 50mL of antibiotic medium 1 1 (DIFCO-BACTO) containing Staphylococcus 
aureus as an indicator strain. The inhibition zones were developed by overnight incubation of the 
plate at 37°C. As shown in FIG. 30, the novel compounds have bioactivity. 

To determine whether the novel spots seen on TLC have the molecular mass 
corresponding to the predicted 1 — desmethylerythromycin A and 10-desmethyl-12- 
deoxyerythromycinA, an ethyl acetate extract from another culture was further analyzed by mass 
spectrometry. The sample was a crude extract of a 20 mL culture grown for 4 days. Mass 
spectrometric analysis revealed the two novel compounds to have masses of 720 and 704, which 
correspond to the molecular ion plus a proton (M+H + ) of 10-desmethylerythromycin A and 10- 
desmethyl-12-deoxyerythromycin A, respectively. 

Example 20: Cloning of the ethylAT Domain from Streptomyces caelestis 
A genomic library of Streptomyces caelestis NRRL02821 (U.S. Patent 3,218,239 issued 
November 16, 1965) DNA was constructed in the Afunctional cosmid pNJl (Tuan, et al., Gene, 
90:21-29 (1990)). Cosmid vector was prepared by digesting 5 \xg of pNJl with EcoRL, 
dephophorylating with CIAP and then digesting with BgUl to generate one arm and also digesting 
5|ig of pNJl with HindUL, dephosphorylating with CIAP and then digesting with BgUl to generate 
the other. Insert DNA was prepared by partially digesting approximately 5|ig of chromosomal S. 
caelestis NRRL-2821 DNA with SailllA according to the procedure outlined in Maniatis et al., 
supra. Digestion conditions were chosen which produced fragment sizes of approximately 40 kb. 
The ligation was performed by mixing approximately 1 jag of the digest chromosomal DNA with 
0.5 jug of each cosmid arm. The ligation was incubated at 16°C overnight. Gigapackll XL 
(Stratagene®) was used for packaging 2|iL of the ligation mix according the manufacturer's 
instructions. Transformation was done in E. coli XL1 -Blue MR cells (Stratagene®). Individual 
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colonies were picked into thirty 96-well plates to give a 99.99% probability that the library 
represented all S. caelestis NRRL-2821 genomic sequences. 

The library was screened using a probe specific for the S. caelestis NRRL-2821 PKS 
region. The probe was generated by PCR amplification of S. caelestis NRRL-2821 genomic DNA 
using degenerate primers designed from consensus ketosynthase (KS) and acyltransferase (AT) 
sequences in the GenBank database. The KS specific oligo (SEQ ID NO: 19) and the AT specific 
oligo (SEQ ID NO: 20) generated a 900 bp PCR fragment. The PCR reaction contained IOjiL 
ThermoPol Buffer, 2 )iL formamide, 25 (iL of 20% glycerol, 3 |iL 50 mM MgCl 2 , 45 jaL water, 50 
pmole of each primer, and approximately 0.2 jig DNA. The sample was heated to 99°C for 5 
minutes, and then placed on ice, at which time a 10|xL cocktail consisting of 2jiL of a 10 mM 
mixture of dATP, dCTP, dGTP, and dTTP, 2 units of Vent DNA polymerase, and 7fxL of water 
was added. The sample was then transferred to a GeneAmp 9600 thermocycler (Peerkin Elmer, 
Foster City, CA) and a temperature cycle of 1 minute at 95 °C, 4 minutes at 50°C, and 4 minutes at 
72°C was repeated 30 times, followed by a 15 minute incubation at 72°C. The desired PCR 
fragment was then isolated from 1.0% low melting agarose by standard procedures. The KS/AT 
probe was made by labeling approximately 50 ng of the PCR fragment with 32 P using the 
Megaprime DNA Labeling System (Amersham Life Science, Arlington Heights, IL). Library 
clones (2,880) were transferred form the 96-well plates to Hybond-N nylon filters (Amersham) 
and screened with the KS/AT probe according to procedures in Maniatis, et al, supra. 
Hybridization was performed at 65°C and the final wash was in 0.1 x SSC at 65°C. Nineteen of 
the clones hybridized strongly with the probe. These clones were then digested with SstI, run on a 
1.0% agarose gel and transferred to Hybond-N nylon filters for Southern analysis using the 
KS/AT probe (FIG. 31). The cosmid identified as pCEL18h5 was chosen for further analysis 
since it contained the largest number of hybridizing restriction fragments. 

The Sstl fragments from cosmid pCEL18h5 were cloned into pGEM-3Zf (Promega, 
Madison, WI) and sequenced using the fmole DNA Cycle Sequencing System (Promega). The 
reactions were run on a Sequi-Gen II Sequencing Apparatus (Bio-Rad, Hercules, CA). Individual 
fragments were oriented relative to one another by sequencing off of cosmid pCEL18h5 using 
primers that hybridized to the 5' and 3' ends of the fragments to generate upstream and 
downstream sequence. These sequences were then matched with sequences from the individual 
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fragments to place them in the proper order. A very large Sstl fragment (>10kb) was further 
digested with Smal to generate smaller fragments for cloning and sequencing. 

By searching the GenBank database with the sequences obtained it was possible to identify 
the various enzymatic motifs associated with the niddamycin PKS cluster and to group these 
motifs into modules (see FIG. 32) based on previous knowledge of Type I PKS organization. The 
C-6 position of the niddamycin macrolactone ring has an aldehyde derived from an ethyl side 
chain (FIG. 33). It was thus predicted that the AT of module 5 of the niddamycin cluster is 
responsible for incorporating this ethyl group into the growing chain. IN addition, the carbon at 
C-7 of the molecule is completely saturated leading to the prediction the ER and DH motifs would 
also be present in module 5. These motifs were, in fact, found at the predicted region of the 
sequence. Furthermore, motifs for the preceding 4 modules were as predicted, with an inactive 
ketoreductase motif in module 4 which leaves a keto group at C-9 of the ring. Sequencing of the 
KR showed that the nucleotide binding site GXGXXG (SEQ ID NO: 27) was mutated to 
DXTXXP (SEQ ID NO: 28). The nucleotide sequence (SEQ ID NO: 29) and corresponding 
amino acid sequence (SEQ ID NO: 33) of the ethyl AT of module 5 are shown in FIG. 34 (top and 
bottom strands respectively). 

A knockout experiment was also performed on this cluster, demonstrating that this 
sequence of DNA encodes the pathway for niddamycin biosynthesis (data not shown). 

EXAMPLE 21 : Construction of plasmid pEAT4 

A multistep strategy was used to construct the plasmid pUC/ethAT/C6 (FIG. 35), which 
consists of the DNA encoding the NidAT5 domain flanked by approximately 2.0 kb of sequence 
upstream and downstream from the eryAT4 encoding sequences, all contained in pUC19. 
EryAT4 flanking DNA was subcloned from pAIBX85. This plasmid is a pCS5 derivative 
containing 8.4 kb of Sac. erythraea DNA from an Xhol site to a BamHl site, in the eryAII gene 
of the erythromycin PKS cluster. These sites correspond to bases 2321 1 and 31581, respectively, 
of GenBank accession number M63676. The EryAT4 5'-flanking DNA was isolated by 
digesting pAIBX85 with Msc I and BstEll (corresponding to nucleotides 23,21 1 and 31,581, 
respectively). The resulting 1800 bp DNA fragment was treated with the Klenow Fragment of 
DNA Polymerase 1, ligated into the Smal site of pUC 19, and transformed into E. coli DH5ot. 
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Clones were selected on LB plates containing 150 j^g/mL ampicillin and 50 jaL of a 2% solution 
of X-gal for blue/white selection. The clones were confirmed by restriction analysis, resulting in 
the intermediate vector pUC/5-flank. For convenient cloning of the NidAT5-encoding 
sequences, an Avrll site was engineered at the 3 ! end of the 5 ! flanking DNA. This was 
accomplished by PCR amplification from the PmH site of the 5' flanking DNA to the BstEU site 
with two oligonucleotides (SEQ ID NO:21 and SEQ ID NO:22). SEQ ID NO:22 incorporates an 
Avrll site and a BamHl site at the 3' end of the 5' 

flanking DNA. PCR conditions were as described in Example 20 using Sac. erythraea DNA 
as template with the following changes: Taq polymerase (GIBCO BRL) was used with the 
accompanying lOx buffer instead of VentR® DNA polymerase and cycling conditions were 
96°C/30 sec, 55°C/30 sec, 72°C/30 sec for 25 cycles. The resulting 300 bp PCR fragment was 
then digested with Pmti and BamHl, gel purified from a 1 .0 % agarose gel with Prep-A-Gene, 
and ligated back into pUC/5'-flank digested with Pmtl and BamHl to give pUC/S'flank-^vrll. 
The ligation was transformed into DH5ct and plated onto LB plates containing 150 jag/mL 
ampicillin. Clones were confirmed by restriction analysis and DNA sequencing. 

In order to clone the NidAT5-encoding DNA fragment downstream of the 5 ! flanking 
DNA, an ^4vrII site was also engineered at the 5'end of The NidAT5-encoding DNA. As depicted 
in FIG. 36, an ^4vrII site could be engineered into the NidAT5 DNA without altering the amino 
acid sequence. Two PCR oligonucleotides (SEQ ID NO:23 and SEQ ID NO:24) were designed to 
create an Avrll site at the 5' end and a BamRI site at the 3' end, respectively, of the NidATS- 
encoding DNA. A convenient Fsel site occurs naturally at the 3' end of NidAT5 -encoding 
sequence, so the resulting PCR fragment contains an Fsel site just upstream of the PCR 
engineered BamHl site SEQ ID NO:23and SEQ ID NO:24 were used in a PCR reaction with the 
template pl6-2.2. This plasmid is pUC19 containing a 2.2 kb Smal fragment from module 5 of 
the niddamycin PKS cluster (see FIG. 32), which encompasses the sequences encoding NidATS. 
The resulting 1.0 kb PCR fragment was digested with ^vrll and BamHl, purified from a 1.0 % 
agarose gel using Prep-A-Gene, and cloned into the ^4vrII / BamHl sites of PUC/5 -flank- Avrll. 
Clones were confirmed by restriction analysis and DNA sequencing, creating the intermediate 
plasmid pUC/5'flank/ethAT. 
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The ErYAT4 3 '-flanking DNA was subcloned by digesting pAIBX95 with PmR and 
Mscl, corresponding to nucleotides 29,231 and 31,209, respectively, from the eryAII gene (Gen 
Bank accession number M63676). The DNA was gel purified on a 1.0 % agarose gel using Prep- 
A-Gene and ligated into the Smal site of pUC19. The ligation was transformed into DH5ot and 
plated as described previously. Clones were confirmed by restriction analysis, resulting in the 
plasmid pUC/3'-flank. 

Attachment of the EryAT4 3'-flanking DNA to the NidATS -encoding sequence was 
accomplished by digesting plasmid pUC/3 ? -flank with Fsel and BamHI, gel purifying the 
fragment from a 1.0 % agarose gel using Prep-A-Gene, and ligating it into pUC/5'flank/ethAT 
that had been previously digested with Fsel and BamHI. The ligation was transformed into DH5ct 
as before and clones were analyzed by restriction analysis, resulting in the intermediate plasmid 
pUC/ethAT/C-6. The final step was to remove the NidAT5/flanking DNA insert from 
pUC/ethAT/C-6 with EcoRL and Hindm and ligate it into The EcoRVHindttl sites of pCS5, 
resulting in the gene replacement/ integration plasmid pEAT4 (FIG. 37). 



EXAMPLE 22: Construction of Sac, erythraea ER720 EAT4-46 



An example of a 6-desmethyl-6-ethylerythromycin A producing microorganism was 
prepared by replacing the DNA fragment encoding the methylmalonyl acyltransferase domain in 
module 4 of the erythromycin PKS (EryAT4) of Sac. erythraea ER720 with a newly discovered 
DNA fragment encoding an ethylmalonyl acyltransferase domain (NidATS) from S. caelestis 
NRRL-2821 . This was accomplished using the recombinant plasmid pEAT4, prepared as 
described in Example 21. Transformation of Sac. erythraea ER720 and resolution of the 
integration event were carried out according to the procedures described in Example 4 using 
10,uL of a DNA solution consisting of 3,uL of pEAT4 DNA from Example 21 at about 1 jug//uL 
in 7 fxh of PM buffer. One Colony was confirmed to be thiostrepton resistant and was inoculated 
into SGGP containing thiostrepton (10 /^g/mL) to isolate chromosomal DNA for Southern 
analysis. Integration of the plasmid DNA into Sac. erythraea ER720 was confirmed by Southern 
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analysis (data not shown). Hybridization was at 65°C and The stringency wash was with 0.1 x 
SSC at 65°C. 



The confirmed integrant was then subcultured into 30mL SGGP without antibiotic using 
10 //L of the previous culture. After three days growth at 30°C the strain was again subcultured 
into 30 mL of fresh SGGP as before and plated onto non-selective R3M plates for sporulation. 
Spores were plated on R3M plates to obtain individual colonies, which were then screened for 
sensitivity to thiostrepton, indicating loss of the plasmid sequence from the chromosome. Nine 
thiostrepton sensitive colonies were isolated and three of them were confirmed by Southern 
hybridization to have the EryAT4 replaced by NidAT5 (FIG.38). Hybridization was at 65° C and 
the stringency wash was with 0.1 x SSC at 65° C. The strain was named Sac. erythraea ER 720 
EAT4-46, referred to as simply EAT4-46. 



EXAMPLE 23: Analysis of compounds produced by EAT4-46 

Compounds produced by strain EAT4-46, whose construction is described in 
Example 22, were characterized by TLC, bioautography and mass spectrometry. 

The cells were grown in 30 mL., of SCM for 4-5 days at 30*C. The culture was 
centrifuged for 10 minutes in a Sorval GLC-4 General Laboratory Centrifuge at setting 10 to 
remove cells. The resulting supernatant was adjusted to pH 9.0 by the addition of 190 pi- of NI- 
1401-1. Then 15 in] of ethyl acetate was added, the tube was vortexed for 30 seconds and then 
centrifuged for 10 minutes to achieve phase separation. The organic phase was removed to 
another tube, and the aqueous phase was re-extracted with 15 nil of ethyl acetate. The second 
organic phase was combined with the first and dried in a Speed-Vac. The residue was taken up in 
30 |iL of ethyl acetate and 10 |aL was spotted onto a Merck 60F-254 silica gel TLC plate. The 
plate was run in a solvent containing isopropyl ether-methanol:NH 4 OH (75:35:2). Erythromycin 
derivatives were visualized by spraying the plates with anisaldehyde:sulfuric acid:ethanol (1 :1 :9). 
The results showed that EAT4-46 produced a compound that migrated with the same r f as 
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erythromycin A produced by wild type Sac. erythraea ER720, except in much lower yield (data 
not shown). 

To determine the molecular mass of the compound, an ethyl acetate extract was prepared 
from a 50 mL SCM culture of EAT4-46 as described above, using a proportionate amount of 
reagents. The resulting residue was taken up in 50 juL of ethyl acetate and run on a TLC plate as 
described previously, except that the plate was not sprayed with anisaldehyde. The compound of 
interest was isolated by scraping the silica resin in the vicinity of the spot and extracting the resin 
as described in Example 8. Mass spectrometric analysis revealed that the compound produced by 
the EAT4-46 strain had a mass of 734, which corresponds to the molecular ion plus a proton 
(M+H + ) of erythromycin A. 

In an attempt to increase substrate pools for the NidAT5 ethylmalonyl AT construction, 
the EAT4-46 strain was grown in 100 mL of SCM media containing 50 mM butyric acid, pH 7.0. 
The culture was grown for 4 days at 30°C and then centrifiiged for 10 minutes in a Sorval GLC-4 
Centrifuge to pellet The cells. The resulting supernatant was adjusted to pH 9.0 by the addition of 
600 fuL of NH4OH and extracted twice with 1/2 volumes of ethyl acetate as described previously. 
After drying in a Speed- Vac rotary concentrator, the extracted material was taken up in 100 iA of 
ethyl acetate and 10 >ul was used for TLC analysis as described previously. Two spots running 
near eryA were observed in the butyric acid fed culture as opposed to only one spot in SCM 
media alone (FIG. 39). To determine the molecular mass of the two spots, most of the remainder 
of The extract was again subjected to TLC, and the compounds in the eryA region of the plate 
were isolated as described previously. Mass spectrometric analysis revealed that the two spots 
had Molecular masses of 734 and 748. A molecular mass of 734 corresponds to the molecular ion 
plus a proton (M+H + ) of erythromycin A, whereas the species of molecular mass 748 is 
consistent with the molecular mass plus a proton (M+H + ) of ethylerythromycin A. 

EXAMPLE 24: Cloning, of the NidAT6 Domain from 
Streptomyces caelestis NRRL-2821 

A genomic library of Streptomyces caelestis NRRL-2821 DNA was generated and 
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screened with a probe specific for PKS genes as described in Example 20. From Southern 
analysis of Sstl. digests of The positive clones (FIG. 31), some clones were selected for further 
analysis. These clones were digested with Smal and run oil a 1% agarose gel for Southern 
hybridization with the PKS specific probe. The analysis revealed that a second cosmid, 
pCEL13f.5, shared many hybridizing bands with pCEL181h5, but also contained two unique 
bands of 1 .9 kb and 6.0 kb (FIG. 40). This cosmid was chosen for further analysis in order to 
determine the sequence of the remaining PKS genes in the niddamycin pathway. Cosmid 
pCEL13f5 was digested with Sstl and the fragments were ligated to pUC19. A large Sstl fragment 
(>10 kb) was further digested with Smal and ligated to pUC19. The ligations were 
transformed into DH5a cells and clones were selected oil LB plates containing 150 /ug/mL, 
ampicillin and 50 ix\ of a 2% solution of X-gal for blue/white selection. DNA from clones 
containing the appropriate insert was isolated using the QIAprep Spin Plasmid Kit (QIAGEN 
Inc., Chatsworth, CA). Subclones were sequenced using the ABI PRISM Dye Terminator Cycle 
Sequencing Ready Reaction Kit (Perkin Miller), and the reactions were run oil a 4.75% 
acrylamide, 9.3 M urea gel in an Applied Biosystems 373 DNA Sequencing System. Ordering of 
the inserts and motif identification was done as described in Example 20. 

The insert in cosmid pCEL13f5 was found to be approximately 25 kb in length, and the 
5' end of the insert had about 10 kb of identical sequence with The 3' end of the insert in 
pCEL18h5. Together, the two cosmids contain all of The PKS genes of the niddamycin pathway 
(FIG. 32). Based oil the structure of niddanlycin (FIG. 33), the AT contained in module 6 
(NidAT6) may utilize hydroxyrnalonate (tartronate) in the biosynthesis of the C-3, C-4 and 0-4 
positions of the macrolactone ring of niddarnycill. (S. Omura et al. (J. Antibiotics 36:61 1-613 
(1983)) have suggested that glycolate may be incorporated in the biosynthesis of the C-3, C-4 
and 0-4 positions of leucornycin, a closely related 16-membered macrolide). The nucleotide 
sequence of NidAT6 (top strand, SEQ ID NO:30) and its corresponding amino acid sequence 
(lower strand, SEQ ID NO:34) are shown in FIG. 41. A comparison of the amino acid sequence 
of NidAT6 with other ATs in the Swissprot database shows that NidAT6 resembles 
methylmalonyl ATs (data not shown). 

EXAMPLE 25: Construction of plasmid iAJCI9/NidAT6 
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Two PCR oligonucleotides (SEQ ID NO:25 and SEQ ID NO:26) are designed to subclone 
the 1024 bp DNA fragment encoding the NidAT6 domain from the niddarnycin PKS cluster and 
to introduce two unique restriction sites, Avrll and Nsil, for cassette cloning. This necessitates 
nucleotide changes, shown in bold in FIG. 42, at the beginning and near the end of the NidAT6- 
encoding DNA sequence. The changes shown also cause The replacement of a proline codon 
near the N-terminus of the NidAT6 domain with a valine codon, in order to increase the 
similarity of the domain junction sequence to that found naturally for some of the AT domains of 
the rapamycin PKS. (In FIG. 42, the underlined nucleotides are the wild-type sequence.) In 
addition, two other restriction sites, EcoTU. and Bg/II, are also introduced at the 5 ! ends of the N- 
terminal and C-terminal oligonucleotides, respectively, for convenient Subcloning of the PCR- 
generated product. The approximately 1 kb NidAT6 domain encoding DNA is amplified using 
methods described in Reagents and General Methods from Cosmid pCEL13f5. The PCR product 
is digested with EcoKL and Bg/11 and subcloned into the EcoRl and Bam HI sites of pUC 18. The 
ligation mixture is transformed into E. coli DH5 (GIBCO BRL) according to the manufacturer's 
instructions and transformants are selected on LB plated containing 150 (ig/mL ampicillin and 50 
|iL of a 2% solution of X-gal for blue/white selection. Clones are confirmed by restriction 
analysis and the fidelity of the insert is confirmed by DNA sequencing. The final plasmid 
construct is named pUCl 8/NidAT6. 

Example 26: Construction of plasmid pEryAT2/NidAT6 
pEryAT2/NidAT6 is constructed using standard methods of recombinant DNA 
technology according to the schematic outlines of FIGS. 15 and 53. To make a gene 
replacement-vector specific for the eryAT2 domain, the two DNA regions immediately 
adjacent to eryAT2 are cloned and positioned adjacent to the DNA encoding the NidAT6 
domain in order to allow homologous recombination to occur. The strategy and protocol for 
constructing the intermediate plasmid containing the flanking regions, pCS5/AT2-flank, are 
described in Example 6 and FIG. 13. The final step in the construction of pEryAT2/NidAT6 
is to ligate the 1 kb NidAT6-encoding DNA fragment having ^vrll and Nsil ends to 
pCS5/AT2-flank (Example 6) cut with the same enzymes to give the gene 
replacement/integration plasmid pEryAT2/NidAT6 (FIG. 43). All ligation mixes are 
transformed into the intermediate host E. coli DH5ct and clones are selected and 
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characterized as described previously. 



Example 27: Construction of Sac erythraea ER720 EryAT2/NidAT6 
A 10-desmethyl- 10-hydroxyerythromycin A and 12-deoxy- 10-desmethyl-10- 
hydroxyerythromycin A producing microorganism is prepared by replacing the DNA fragment 
encoding the methylmalonyl acyltransferase domain of module 2 of the erythromycin PKS 
(EryAT2) of Sac. erythraea ER720 with a DNA fragment encoding a hydroxymalonyl 
acyltransferase domain (NidAT6) from S. caelestis NRRL-2821. This is accomplished with the 
recombinant plasmid, pE.ryAT2/NidAT6, prepared as described in Example 26. Transformation 
of ER720 and resolution of the integration event are carried out as described in Example 4 using 
10 |iL of DNA solution consisting of 3 |iL of pEryAT2/NidAT6 DNA at about 1 (ig/jaL in 7 (aL 
of Pm buffer. Thiostrepton resistant colonies are isolated and inoculated into SGGP containing 
thiostrepton (10 fig/mL) to isolate chromosomal DNA for Southern analysis. Integration of the 
plasmid DNA into the ER720 chromosome is further confirmed by Southern hybridization. 
Hybridization is at 65°C and the stringency wash is with 0. 1 x SSC at 65°C. 

Confirmed integrants are grown in SGGP without antibiotic for four days and then diluted 
1000-fold into fresh medium and grown for 4 more days. Protoplasts are then prepared and plated 
onto non-selective R3M plates to obtain individual colonies, which are screened for sensitivity to 
thiostrepton, indicating loss of the plasmid sequence from the chromosome. Thiostrepton 
sensitive colonies are then selected and these are confirmed by Southern hybridization, using 
conditions described above, to have the EryAT2 replaced by NidAT6. The strain is designated 
Sac. erythraea ER 720 EryAT2/NidAT6. 

Example 28: Analysis of compounds produced by Sac erythraea ER720 EryAT2/NidAT6 
Compounds produced by the recombinant Sac erythraea strain, ER720 
EryAT2/NidAT6, whose construction is described in Example 27, are characterized by TLC, 
bioassay, and mass spectrometry. 

For TLC analysis cells are grown ill either SGGP or SCM medium (20 g Soytone, 15 g 
Soluble Starch, 10.5 g MOPS, 1.5 g Yeast Extract and 0. 1 g CaCl 2 per liter of distilled H 2 0) for 
4-5 days at 30°C. The culture is centrifuged for 5 min. to remove cells. The resulting supernatant 
is removed to another tube and the pH adjusted to 9.0 by the addition of 6(iL/mL of NH 4 OH. 
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Then an equal volume of ethyl acetate is added, the liquid is mixed for 2 min. and then 
centrifuged for approximately 5 min. to achieve phase separation. The organic phase is removed 
to another tube, and the aqueous phase is re-extracted with a half volume of ethyl acetate. The 
second organic phase is combined with die first and dried in a Speed Vac. The residue is taken up 
in approximately 25 pL, of ethyl acetate and 15|uL are spotted onto a Merck 60F-254 silica gel 
TLC plate. The plate is run in isopropyl ether:methanol:NH40H (75:35:2). Erythromycin 
derivatives are visualized by spraying the plates with anisaldehyde: sulfuric acid:ethanol- (1 :1 :9). 
Using this reagent, two novel compounds predicted to be 10-desmethyl-10-hydroxyerythromycin 
A and 12-deoxy-10-desmethyl-10-hydroxyerythromycin A, are expected to appear as blue spots 
running slightly slower than erythromycin A. 

To determine whether the novel spots seen on TLC have the molecular mass 
corresponding to the predicted 10-desmethyl-10-hydroxyerythromycin A and 12-deoxy-10- 
desmethyl-10-hydroxy erythromycin A, the remaining extract is further analyzed by mass 
spectrometry. The two novel compounds are predicted to have masses of 736 and 720, which 
correspond to the molecular ion plus a proton (M+H*) of 10-desmethyl-10-hydroxyerythromycin 
A and 12-deoxy-10-desmethyl-10- hydroxyerythromycin A, respectively. 

EXAMPLE 29: Construction of plasmid pABX9 

The 9.6 kb BamHI-XhoI segment comprised between sequence coordinates 21.96 and 

31.52 was isolated from cosmid pSl and ligated to Sall-digested pUC19 DNA. The resulting 
mixture contained the desired plasmid pABX9. 

EXAMPLE 30: Construction of E. coli K12 DH5a/pABX9 

Approximately 10 ng of plasmid pABX9, prepared as described in Example 29, were 

transformed into E. coli K12 DH5a and few of the resulting white Ap R colonies that appeared on 
the LB-agar plates containing X-gal and ampicillin were analyzed for their plasmid content. One 
colony was found to carry pABX9, as verified by the observation of fragments of 3.93, 3.39, 2.01, 
1.56, 0.87, and 0.48 kb in size upon agarose gel electrophoresis after Smal digestions of the 
plasmid. 

EXAMPLE 31 : Construction of plasmid pABX9DN 



80 



Plasmid pABX9, isolated from E. coli K12 DH5a/pABX9, was digested with Ncol and 



then treated with T4 DNA ligase. The resulting mixture contained the desired plasmid 



PABX9DN. 



EXAMPLE 32: Construction of E. coli K12 DH5a/pABX9DN 

Approximately 10 ng of plasmid pABX9DN, prepared as described in Example 31, were 

transformed into E. coli K12 DH5a and a few of the resulting white Ap R colonies that appeared 
on the LB-agar plates containing X-gal and ampicillin were analyzed for their plasmid content. 
Colonies carrying pABX9DN exhibited a single Ncol fragment of 1 1.5 kb visible by agarose gel 
electrophoresis, confirming that the 813 bp Ncol-Ncol fragment from pABX9 has been deleted in 
PABX9DN. 

EXAMPLE 33: Construction of plasmid pABX95DN 

Plasmid pABX9DN was digested with EcoRI and HindlH and ligated to pWHM3 digested 

with the same two enzymes. The resulting mixture contained the desired plasmid pABX95DN. 

EXAMPLE 34: Construction of £ coli K12 DH5a/pABX95DN 

Approximately 10 ng of plasmid pABX95DN, prepared as described in Example 33, were 

transformed into E. coli K12 DH5a and a few of the resulting white Ap R colonies that appeared 
on the LB-agar plates containing X-gal and ampicillin were analyzed for their plasmid content. 
Colonies carrying pABX95DN exhibited fragments of 8.8 and 7.2 kb visible in agarose gels after 
EcoRI and HindHI digestion. 

EXAMPLE 35: Construction of Sac, erythraea AKR5 carrying the 
eryAKRS allele by gene replacement 
Approximately 1 mg of plasmid pABX95DN, isolated from E. coli K12 
DH5a/pABX95DN, was transformed into Sac, erythraea NRRL 2338 and stable TH R colonies 
were isolated. Serial dilutions of one of these colonies were screened for the loss of the antibiotic 
resistance marker and total DNA from 5 Th s colonies as well as from untransformed Sac, 
erythraea NRRL 2338 was digested with SstI and analyzed by Southern hybridization employing 
the 0.8 kb Sail fragment between sequence coordinates 24.26 and 25.06 (from pABX9) as probe. 
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Whereas NRRL 2338 showed one SstI band of 3.7 kb that hybridized to the probe, samples from 
four of the Th strains exhibited a Sstl-hbridizing band of 6. 1 kb indicating the presence of the 
mutant allele. One of these colonies was kept and designated strain AKR5, It carries a deletion of 
813 bp in the KR5 segment of eryA and is designated the eryAKRS allele. 

EXAMPLE 36: Isolation, purification and properties of 5-oxo-5,6-dideoxy-3- 
a-mycarosyl erythronolide B from Sac, erythraea AKR5 

A 10-liter fermentation of Sac, erythraea AKR5 carrying the eryAKRS allele in a 

Biolafitte fermentor using SNC Media. The fermentor was inoculated with 100 ml of a 3 day old 
seed. The pC> 2 was initially 80 ppm and the temperature was maintained at 32°C. The pH was 
controlled to 7.0±0.2 by addition of propionic acid or potassium hydroxide as needed. At harvest 
(3 days), the whole broth was extracted three times with 4-liter portions of ethylacetate. The 
combined extracts were concentrated under reduced pressure and the residue was 
chromatographed on a column (50 x 5 cm) of silica gel packed and loaded in toluene and eluted 
with a stepwise gradient of increasing concentration of isopropanol in toluene. Fractions were 
analyzed by TLC and spots were detected by spraying with anisaldehyde sulfuric acid spray 
reagent and heating. A major component giving blue colored spots eluted with approximately 7% 
isopropanol. Fractions containing this material were combined and concentrated to a residue (800 
mg). This was further choromatographed on a column (100 x 3 cm) of Sephadex LH-20 in 
chloroform-heptane-ethanol, 10:10:1, v/v/v. Fractions were analyzed as above, early fractions (9- 
13)yielded 5,6-dideoxy-3-a-mycaosyl-5-oxoerythronolide B (45 mg) which was crystallized from 
heptane/ethylacetate mixture to mp 163°-164°C. 
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Structure was determined by single crystal X-ray diffraction. 

Later fractions (15-17) yielded 5,6-dideoxy-5-oxoerythronolide B (10 mg) and still later 

fractions yielded 5,6-dideoxy-6-6a-epoxy-5-oxoerythronolide B (2.8 mg). 

EXAMPLE 37: Construction of plasmid pALeryAKR2 

The 1.3 kb DNA segment comprised between coordinates 8.63-9.93 (fragment 1) is 

amplified by PCR employing two oligodeoxynucleo tides, la (5 ? - 
GGGAGCATGCTCTCGGTGCGCGGCGGCCGC-3') (SEQ ID NO:35) and lb (5 ? - 
GCCCTGCAGCGCGTACTCCGAGGTGGCGGT-3') (SEQ ID NO:36). Similarly, the 1.3 kb 
DNA segment between coordinates 9.99-1 1.26 (fragment 2) is PCR-amplified emplying primers 
2a (5*- TGGTCTGCAGGCGAGGCCGGACACCGAGG-3') (SEQ ID NO: 37) and 2b (5'- 
GGAAGAAGTCAAAGTTCCTCGGTCCCTTCT-3 ') (SEQ ID NO:38). After digestions with 
Sphl+Pstl (fragment 1) and Pstl+EcoRI (fragment 2), the two fragments are ligated to 
EcoRI+Sphl-digested pWHM3. The resultant mixture contains the desired plasmid 
pALeryAKR2. 

EXAMPLE 38: Construction of E. coli K12 DH5 a/p ALeryAKR2 

Approximately 10 ng of plasmid pALeryAKR2, prepared as descried in Example 37, are 

transformed into E. coli K12 DH5a, and a few of the resulting white Ap R colonies that appear on 
the LB-agar plates containing X-gal and ampicillin are analyzed for their plasmid content. The 
identity of plasmid pALeryA2KR2, 9.8 kb in size, and c arrying a 2.6 kb EcoRI-SphI insert with 
an internal PstI site, is verified by SA1I digestion (fragments at 2.91, 2.21, 1.61, 1.42, 1.08, 0.29, 
0.12 and 0.10 kb are released, visible by agarose gel eletrophoresis). PALeryAKR2 contains an 
in-frame deletion of 102 base pairs of the corresponding segment of the wild type eryA 
chromosomal DNA. The cloned segment in pALeryAKR2 is designated the eryAKR2 allele. 

EXAMPLE 39: Construction of Sac, erythraea AKR2 carrying the 
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eryAKR2 allele by gene replacement 

Approximately 1 mg of plasmid pALeryAKR2, isolated from E. coli K12 

DH5cc/pALeryAKR2, is transformed into Sac, erythrea protoplasts and the stable Th R colonies are 
isolated. Serial dilutions of one of these colonies are screened for loss of the antibiotic resistance 
marker, and six Th colonies are analyzed for their genotype by Southern hybridization. Total 
DNA from the six Th s colonies and from untransformed Sac, erythraea NRRL2338 is digested 
with PstI and with Sail and is then examined by Southern hybridization using the 2.6 kb EcoRI- 
Sphl insert from pALeryAKR2 as probe. Whereas NRRL2338 contains a 39 kb PstI hybridizing 
band, colonies in which the mutation in KR2 has been introduced (strain AKR2) exhibit two bands 
of approximately equal intensity, one at 27 kb and the other at 12 kb. The Sail digest, with bands 
at 1.04, 0.75, 0.29, 0.12 and 0.10 kb common to NRRL2338 and AKR2, but with the 1.16 kb band 
in NRRL2338 replaced by the 1.06 kb band in AKR2, confirms that the only change introduced 
into strain AKR2 is the deletion of the 102 bp segment from KR2 resulting in a strain carrying the 
eryAKR2 allele. 

EXAMPLE 40: Isolation and purification of 1 1-deoxy-l 1-oxoerythromycin A 
The fermentation beer of strain AKR2, cooled to 4°C is adjusted to pH 8.0 and is 
extracted sequentially with three equal volumes of methylene chloride. The combined 
methylene extracts are concentrated to an oily residue and partitioned between heptane and 
methanol. The methanol layer is removed, washed once with heptane and concentrated to a 
residue. The residue is digested in methylene chloride and washed once with potassium 
phosphate buffer pH 7.8 and once with water. The methylene chloride layer is concentrated to 
a reside and digested in the lower phase (1:1:1 ,v/v/v) of a carbon tetrachloride; methanol; 
aqueous phosphate buffer (0.05M, pH 7.0) system and chromatographed on an Ito Coil Planet 
Centrifuge in the same system. Fractions containing the desired 1 1-oxo-l 1-deoxy erythromycin 
A were combined, concentrated, digested in methylene chloride, washed well with water and 
concentrated on rotary evaporator under reduced pressure to yield 1 1-deoxy-l 1- 
oxoerythromycin A as an off-white solid L53-18A. 1 1 -Doxy- 1 1-oxoerythromycin A is 
dissolved in tetrahydrofuran and the solution is diluted with an equal volume of water. This is 
then acidified to pH 4.0 and allowed to stand at room temperature for 4 hours. The pH is 
adjusted to 9.0 and the solution is diluted with an equal volume of water and extracted with two 
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volumes of methylene chloride. The combined methylene chloride extracts were evaporated to 
dryness under reduced pressure to yield antibiotic L53-18A as a white solid. 

EXAMPLE 41 : Construction of plasmid pALeryADH4 
Primers 3a (GCGCGAGCTCGACGACCAGGGCGGCATGGT) (SEQ ID NO:39) and 3b 

(GGTGGCATGCTGCGACCACTGCGCGTCGGC) (SEQ ID NO:40) are used to PCR-amplifV 

the 1.05 kb eryA segment of the Sac. Erythraea chromosome between sequence coordinates 

18.47-20.07 (fragment 3), and primers 4a (AGCTGCATGCTCTGGACTGGGGACGGCTAG) 

(SEQ ID NO:41) and 4b (CGCGGGATCCCAGCTCCCACGCCGATACCG) (SEQ ID NO:42) 

are used to amplify the 1.35 kb segment between sequence coordinates 20.58-21.96 (fragment 4) 

as described in Example 29. Fragment 3 and 4, after digestions with Sstl+SphI and with 

Sphl+BamHI, respectively, are ligated to SstI-, BamHI-digested pWHM3. The resulting ligation 

mixture contains the desired plasmid pALeryADH4. 

EXAMPLE 42: Construction of E. coli K12 DH5oc/pALeryADH4 

Approximately 10 ng of pALeryADH4, prepared as described in Example 41, are 

transformed into E. coli K12 DH5a, and a few of the resulting white Ap R colonies that appear on 
the LB-agar plates containing X-gal and ampicillin are analyzed for their plasmid content. The 
identity of plasmid pALeryADH4, 9.6 kb in size, is verified by Sphl+EcoRI digestion (fragments 
at 7.2, 1.35 and 1.05 kb are released). pALeryADH4 carries a 498 base pair in-frame deletion of 
the corresponding segment of the wild type eryA DNA. The cloned segment in pALeryADH4 is 
designated the eryADH4 allele. 

EXAMPLE 43: Construction of Sac. ErythraeaADHA carrying 
the eryADH4 allele by gene replacement 

Approximately 1 mg of plasmid pALeryADH4, isolated from E. coli K12 

DH5a/pALeryADH4, is used for transformation into Sac, erythraea protoplasts and stable Th R 
colonies are isolated. Serial dilutions of one of these colonies are screened for loss of the 
antibiotic resistance marker and six Th colonies are analyzed for their genotype by Southern 
hybndiation. Total DNA from the six Th colonies and from untransformed Sac, erythraea 
NRRL2338 is digested with Sphl and with SstI and examined by Southern hybridization using the 
2.4 kb Sstl-BamHI insert from pSLeryADH4 as probe. Strains in which the wild type allele has 
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been replaced by the mutated copy show two SphI bands, one at 13.5 kb and the other at 12.4 kb, 
whereas the wild type strain exhibits a single band at 26 kb. The SstI pattern, with the 2.9 kb band 
from NRRL2338 being replaced in ADH4 by a 2.5 kb band, confirms that the 487 bp deletion 
created in plasmid pALeryADH4 has been transferred into the chromosome of ADH4. Strains 
that carry the eryADH4 allele in place of the wild type sequence are designated Sac, erythraea 
ADH4. 

EXAMPLE 44: Isolation and characterization of 7-hydroxyerythromycin A 
and 6-deoxy-7-hydroxyerythromycin A 

The fermentation beer of strain ADH4 is cooled to 4°C and the pH is adjusted to 5.0. The 

mixture is extracted once with an equal volume of methylene chloride. The pH of the aqueous 
layer is readjusted to 9.0 and two further methylene chloride extracts are carried out. These two 
extracts are combined, washed with water and concentrated to a residue. This is digested in 10 ml 
of the upper phase of a (3:7:5,v/v/v) mixture of hexane, ethylacetate, aqueous phosphate buffer 
(0.05M, pH 7.5) and chromatographed on an Ito Coil Planet Centrifuge in the same system. 
Fractions containing the desired 7-hydroxyerythromycin were combined, concentrated, and 
partitioned between methylene chloride and dilute (pH 9.5) ammonium hydroxide solution. 
Fractions containing the desired 6-deoxy-7-hydroxyerythromcin were combined, concentrated, 
and partitioned betweenmethylene chloride and dilute (pH 9.5) ammonium hydroxide solution. 
The methylene chloride layers are washed with water and then concentrated to yield the desired 7- 
hydroxyerythromycin A and 6-deoxy-7-hydroxyerythromycin A as white foams. 

EXAMPLE 45: Construction of plasmid pALeryAM4.1 
Primers 9a (GCGCCGAATTCTCGAGACGGCGTGGGAGGCA) (SEQ ID NO: 43) and 
9b (TTGCGGTACCAGTAGGAGGCGTCCATCGCG) (SEQ ID NO: 44) are employed to PCR- 
amplify the 2.0 kb eryA segment between sequence coordinates 17.35-19.38 (fragment 9), After 
digestion with EcoRI+Kpnl, fragment 9 is ligated to pUC19 cut with the same two enzymes. The 
resulting mixture contains the desired plasmid pALeryAM4.1 . 

EXAMPLE 46: Construction of E. coli K12 DH5a/pALeryAM4. 1 
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Approximately 10 ng of pALeryAM4,l, prepared as described in Example 45, are 

transformed into E. coli K12 DH5a, and a few of the resulting white Ap R colonies that appear on 
the LB-agar plates containing X-gal and ampicillin are analyzed for their plasmid content. The 
identity of plasmid pALeryAM4.1, 4.7 kb in size, is verified by digestion with Sail (fragments of 
2.8, 0.85, 0.53, 0.27 and 0.22 kb are observed by agarose gel eletrophoresis). 

EXAMPLE 47: Construction of plasmid pALeryAM4.2 
Primers 10a (GCTGGGATCCCGCGGCGCGGGTTGCAGCAC) (SEQ ID NO: 45) and 
10 b (CGGAACTCGGTGAGCATGCCGGGACTGCTC) (SEQ ID NO: 46) are used to PCR- 
amplify the 2.1 kb eryA segment between sequence coordinates 21.94-24.00 (fragment 10). The 
2.6 kb fragment KpnI(96)-BamHI(102) from cosmid clone pRl, and fragment 10 cut with 
BamHI+SphI, are ligated to pALeryAM4.1 cut with KpnI+SphI The resulting mixture contains 
the desired plasmid pALeryAM4.2. 

EXAMPLE 48: Construction of E. coli K12 DH5a/pALeryAM4.2 

Approximately 10 ng of pALeryAM4.2, prepared as described in Example 47, are 

transformed into E. coli K12 DH5a, and a few of the resulting white Ap R colonies that appear on 
the LB -agar plates containing X-gal and ampicillin are analyzed for their plasmid content. The 
identity of plasmid pALeryAM4.2, 9.3 kb in size, is verified by digestion with XhoI+SphI (to 
ensure that the entire 6.65 kb insert is released) and with Sail, with fragments of 2.8, 1.82, 1.09, 
0.94, 0.85, 0.75, 0.45, 0.27, 0.22 and 0.13 kb are observed by agarose gel electrophoresis. 

EXAMPLE 49: Construction of plasmid pALeryAMl 

The 2.9 kb Smal(4)-Smal(20) fragment from cosmid clone pRl is ligated to pUC12 cut 

with SmaL The resulting mixture contains plasmid pALeryAMl . 

EXAMPLE 50: Construction of E. coli K12 DH5 q/p ALeryAM 1 

Approximately 10 ng of pALeryAMl, prepared as described in Example 49, are 

transformed into E. coli K12 DH5a, and a few of the resulting white Ap R colonies that appear on 
the LB-agar plates containing X-gal and ampicillin are analyzed for their plasmid content. The 
identity of plasmid pALeryAMl, 5.6 kb in size, is verified by digestion with Smal (the 2.9 kb 
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insert is released) and with SphI, with release of one 4.4 and one 1 .07 kb bands. Both orientations 



of the insert in plasmid pALeryAMl are useful 



EXAMPLE 5 1 : Construction of plasmid pALeryAM4.3 

Plasmid pALeryAMl is cut with Xhol to completion, partially with SphI, and the resulting 

5.25 kb band, isolated from an agarose gel, is ligated to the 6.65 kb insert released from 
pALeryAM4.2 by XhoI+SphI digestion. The resulting mixture contains the desired plasmid 
pALeryAM4.3. 

EXAMPLE 52: Construction of E. coli K12 DH5a/pALeryAM4.3 
Approximately 10 ng of pALeryAM4.3, prepared as described in Example 51, are 
transformed into E. coli K12 DH5a, and a few of the resulting white Ap R colonies that appear on 
the LB-agar plates containing X-gal and ampicillin are analyzed for their plasmid content. The 
identity of plasmid pALeryAM4.1, 1 1.9 kb in size, is verified by XhoI+SphI digestion (fragments 
of 6.65 and 5.25 kb are visible by agarose gel-electrophoresis). Plasmid pALeryAM4.3 carries the 
entire eryA module 4 inserted into the KS region of module L The cloned insert in 
pALeryAM4.3 is designated the eryAM412 allele. 

EXAMPLE 53: Construction of plasmid pALeryAM4.4 

Plasmid pALeryAM4.3 is cut with EcorRI+HindHI, and the resulting 9.2 kb band, 

recovered from an agarose gel, is ligated to pWHM4 cut with the same two enzymes. The 
resulting mixture contains the desired plasmid pALeryAM4.4. 

EXAMPLE 54: Construction of E. coli K12 DH5a/pALeryAM4.4 
Approximately 10 ng of pALeryAM4.4, prepared as described in Example 53, are 
transformed into E. coli K12 DH5a, and a few of the resulting white Ap R colonies that appear on 
the LB-agar plates containing X-gal and ampicillin are analyzed for their plasmid content. The 
identity of plasmid pALeryAM4.4, 16.5 kb in size, is verified by EcorRI+Hindlll digestion with 
fragments of 9.2 and 7.3 kb released. Plasmid pALeryAM4.4 carries the eryAM412 allele on the 
Sac, erythraea multicopy vector pWHM4. 
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EXAMPLE 55: Construction of Sac, erythraea AM412 carrying 
the eryAM412 allele by gene conversion 

Approximately 1 mg of plasmid pALeryAM4.4, isolated from E. coli K12 

DH5a/pALeryAM4.4, is used for transformation into Sac, erythraea strain AKS1 protoplasts. A 
few hundred transformants are screened for antibiotic production by the agar-plug assay, and one 
of the colonies found to produce antimicrobial activity is cured of pALeryAM4.4 by protoplast 
formation and regeneration as described in General Methods. Total DNA from six antibiotic- 
producing Th s colonies (strain AM412) and from strain AKS1 is digested with SphI and with 
Xhol and the resulting Southern blot is hybridized first to the 2.9 kb insert from pALeryAMl, and 
then to the 2.9 kb Sstl(95)-Sstl(101) fragment from plasmid pALeryAM4.2. With the first probe, 
the SphI band at 0.8 kb in strain AKS1 is seen to be replaced by a 7.5 kb band in strain AM412, 
whereas the other two bands at 2.4 kb and 5.2 kb are unaffected. In the Xhol digest, the AKS1 
band at 2.9 kb is replaced by a 9.6 kb band in AM412, with the other band at 5.2 kb conserved in 
both strains. Using the Sstl(95)-Sstl(101) fragment as probe, strain AKS1 exhibits one band at 
25.5 kb and one at 17.9 kb in the SphI and Xhol digests, respectively, whereas, in addition to these 
bands, strain AM412 shows one SphI band at 7.5 kb and one Xhol band at 9.6 kb. In this way, it 
is established that the eryAKSl allele has been converted into the eryAM412 allele in strain 
AM412. 

EXAMPLE 56: Isolation and purification of 14-(1 -propyl) erythromycin A 

At harvest the fermentation is adjusted to pH 9.5 and extracted twice with equal volumes 

of methylene chloride. The combined extracts are washed once with water and concentrated to an 
oily residue. This is partitioned in a heptane methanol water (5:5:1, v/v/v) system and the lower 
layer is washed once with heptane and then concentrated to a semisolid residue. This is digested 
in methanol and chromatographed over a column of Sephadex LH-20 in methanol. Fractions are 
tested for bioactivity in an agar diffusion assay on plates seeded with the macrolide-sensitive 
strain Staphylococcus aureus Th R . Active fractions are combined and further purified by 
chromatography over silica gel a chloroform:methanol gradient containing 0.1% triethylamine. 
Fractions containing the desired 14-(1 -propyl) erythromycin A are combined and concentrated to 
yield the product as a white solid. 
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EXAMPLE 57: Construction of plasmid pALeryAM5.1 

The 4.7 kb eryA fragment between sequence coordinates 23.65-28.36 (fragment 1 1) is 

PCR-amplified emptying primers 1 la (ATGCTCGAGATCTCGTGGGAGGCGCTGGA) (SEQ 
ID NO:47) and 11 b (AGAACTCGGTGAGCATGCCCGGGCCCGCCA) (SEQ ID NO:48). 
Fragment 11, after digestion with XhoI+SphI, is ligated to the 5.25 kb fragment resulting from 
complete Xhol and partial SphI digestions of pALeryAMl, as in Example 51. The resulting 
mixture contains the desired plasmid pALeryAM5.1. 

EXAMPLE 58: Construction of E. coli K12 DH5a/p ALeryAM5 . 1 

Approximately 10 ng of pALeryAM5.1, prepared as described in Example 57, are 

transformed into E. coli K12 DH5oc, and a few of the resulting white Ap R colonies that appear on 
the LB-agar plates containing X-gal and ampicillin are analyzed for their plasmid content. The 
identity of plasmid pALeryAM5.1, 9.95 kb in size, is verified by Sphl+Xhol digestions, with 
fragments of 5.25 and 4.7 kb released, and by Smal digestions where fragments of 3.39, 2.68 and 
1.94 (doublet) kb are observed. Plasmid pALeryAMS.l carries the entire eryA module 5 inserted 
into the 3-ketoacyl ACP synthase region of module L The cloned insert in plasmid 
pALeryAM5.1 is designated the eryA512 allele. 

EXAMPLE 59: Construction of plasmid pALeryAM5.2 

Plasmid pALeryAM5.1 is cut with EcoRI+Hindlll and the resulting 6.3 kb fragment, 

recovered from an agarose gel, is ligated to pWHM4 cut with the same two enzymes. The 
resulting mixture contains the desired plasmid pALeryAM5.2. 

EXAMPLE 60: Construction of E. coli K12 DH5a/pALeryAM5.2 

Approximately 10 ng of pALeryAM5.2, prepared as described in Example 59, are 

transformed into E. coli K12 DH5a, and a few of the resulting white Ap R colonies that appear on 
the LB-agar plates containing X-gal and ampicillin are analyzed for their plasmid content. The 
identity of plasmid pALeryAM5.2, 13.6 kb in size, is verified by digestion with EcoRI+Hindlll, 
with fragments of 7.3 and 6.3 kb released. Plasmid pALeryAM5.2 contains the eryAM512 allele 
on the Sac, erythraea multicopy vector pWHM4. 
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EXAMPLE 61 : Construction of Sac, erythraea AM512 carrying the 
eryAM512 allele by gene conversion 

Approximately 1 mg of plasmid pALeryAM5.2, isolated from E. coli K12 

DH5ot/pALeryAM5.2, is used for transformation into Sac, erythraea strain AKS1 protoplasts. A 
few hundred transformants are screened for antibiotic production by the agar-plug assay, and one 
of the colonies found to produce antimicrobial activity is cured of pALeryAM5.2 by protoplast 
formation and regeneration as described in General Methods. Total DNA from six antibiotic- 
producing, Th s colonies (strain AM512) and from strain AKS1 is digested with SphI and with 
Xhol and the resulting Southern blot is hybridized first to the 2.9 kb insert from pALeryAMl, and 
then to the 0.8 kb NcoIQ 19)-NcoI(123) fragment from plasmid pALeryAM5.1. With the first 
probe, the SphI band at 0.8 kb in strain AKS1 is replaced by a 5.5 kb band in strain AM512, 
whereas the other two bands at 2.4 kb and 5.2 kb are unaffected. In the Xhol digest, the AKS1 
band in strain AM512, whereas the other two bands at 2.4 kb and 5.2 kb are unaffected. In the 
Xhol digest, the AKS1 band at 2.9 kb is replaced by a 7.6 kb band in AM5 12, with the other band 
at 5.2 kb conserved in both strains, Using the NcoI(l 19)-NcoI(123) fragment as probe, strain 
AKS1 exhibits one band at 25.5 kb and one at 17.9 kb in the SphI and Xhol digests, respectively, 
whereas, in addition to these bands, strain AM512 shows one SphI band at 5.5 kb and one Xhol 
band at 7.6 kb. In this way, it is established that the eryAKSl allele has been converted into the 
eryAM512 allele in strain AM512. 

EXAMPLE 62: Isolation and purification of 14|"l(l-hydroxypropyl)l erythromycin A 
At harvest the pH of the fermentation of AM512 is adjusted to 9.5 and the mixture is 
extracted twice with equal volumes of ethylacetate. The combined ethylacetate extracts are 
washed with water, dried and partitioned in a heptane, methanol, water (5:5:1, v/v/v) system. The 
lower (methanolic phase) is washed with an equal volume of heptane and is concentrated to a 
residue. This is chromatographed on a Sephadex LH-20 column in methanol and fractions 
containing the desired 14[l(l-hydroxypropyl)] erythromycin A are concentrated and further 
purified by chromatography on an Ito Coil Planet Centrifuge in a system consisting of n-heptane, 
benzene, acetone, isopropanol, 0.65M, pH 7.0 aqueous potassium phosphate buffer (5:10:2:3:5, 
v/v/v/v/v). Fractions containing the desired product are concentrated to a solid residue and 
partitioned between methylene chloride and dilute (pH 9.5) ammonium hydroxide. The organic 
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layer is washed with water and concentrated to yield 14[l(l-hydroxypropyl)]erythromycin A 
white solid." 
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